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A METHOD FOR GENERATING BIRNAVIRUS FROM 

SYNTHETIC RNA TRANSCRIPTS 

Background o f the Invention 

Infectious bursal disease virus (IBDV), a member of the Bimaviridae 

5 family, is the causative agent of a highly immunosuppressive disease in 
young chickens (Kibenge, F.S.B., etal., J. Gen. Virol., 69, 1757-1775 (1988)). 
Infectious bursal disease (IBD) or Gumboro disease is characterized by the 
destruction of lymphoid follicles in the bursa of Fabricius. In a fully 
susceptible chicken flock of 3-6 weeks of age the clinical disease causes 

10 severe immunosuppression, and is responsible for losses due to impaired 
growth, decreased feed efficiency, and death. Susceptible chickens less than 
3 weeks old do not exhibit outward clinical signs of the disease but have a 
marked infection characterized by gross lesions of the bursa. 

The virus associated with the symptoms of the disease is called 

1 5 infectious bursal disease virus (IBDV). IBDV is a pathogen of major economic 
importance to the nation and world's poultry industries. It causes severe 
immunodeficiency in young chickens by destruction of precursors of antibody- 
production B cells in the bursa of Fabricius. Immunosuppression causes 
increased susceptibility to other diseases, and interferes with effective 

20 vaccination against Newcastle disease, Marek's disease and infectious 

bronchitis disease viruses. 

There are two known serotypes of IBDV. Serotype I viruses are 
pathogenic to chickens whereas serotype II viruses infect chickens and 
turkeys. The infection of turkeys is presently of unknown clinical significance. 

25 IBDV belongs to a group of viruses called Bimaviridae which includes 

other bisegmented RNA viruses such as infectious pancreatic necrosis virus 
(fish), tellina virus and oyster virus (bivalve mollusks) and drosophila X virus 
(fruit fly). These viruses all contain high molecular weight (MW) double- 
stranded RNA genomes. 

30 The capsid of the IBDV virion consists of several structural proteins. 

As many as nine structural proteins have been reported but there is evidence 
that some of these may have a precursor-product relationship (Kibenge, 
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F.S.B., et al., J. Gen. Virol., 69, 1757-1775 (1988)). The designation and 
molecular weights of the viral proteins (VP) are as shown below. 



Viral Protein Molecular Weight 

5 ■ 



VP1 


90kDa 


VP2 


41 kDa 


VP3 


32 kDa 


VP4 


28 kDa 


VP 5 


17 kDa 



Two segments of double-stranded RNA were identified in the genome 
of IBDV. The IBDV genome consists of two segments of double-stranded 
(ds)RNA that vary between 2827 (segment B) to 3261 (segment A) nucleotide 

1 5 base pairs (Mundt, E. et al. , Virology, 209, 10-18(1 995)). The larger segment 
A encodes a polyprotein which is cleaved by autoproteolysis to form mature 
viral proteins VP2, VP3 and VP4 (Hudson, P J. et al., Nucleic Acids Res., 14, 
5001-5012 (1986)). VP2 and VP3 are the major structural proteins of the 
virion. VP2 is the major host-protective immunogen of IBDV, and contains the 

20 antigenic regions responsible for the induction of neutralizing antibodies 
(Azad, et al., Virology, 161 , 145-152 (1987)). A second open reading frame 
(ORF), preceding and partially overlapping the polyprotein gene, encodes a 
protein (VP5) of unknown function that is present in IBDV-infected cells 
(Mundt, E., et at., J. Gen. Virol., 76, 437-443, (1995)). The smaller segment 

25 B encodes VP1, a 90-kDa multifunctional protein with polymerase and 
capping enzyme activities (Spies, U., et al., Virus Res., 8, 127-140 (1987); 
Spies, U., et al., J. Gen. Virol., 71, 977-981 (1990)). 

It has been demonstrated that the VP2 protein is the major host 
protective immunogen of IBDV, and that it contains the antigenic region 

30 responsible for the induction of neutralizing antibodies. The region containing 
the neutralization site has been shown to be highly conformation-dependent. 
The VP3 protein has been considered to be a group-specific antigen because 
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it is recognized by monoclonal antibodies directed against it from strains of 
both serotype I and II viruses. The VP4 protein appears to be a virus-coded 
protease that is involved in the processing of a precursor polyprotein of the 
VP2, VP3 and VP4 proteins. 
5 Although the nucleotide sequences for genome segments A and B of 

various IBDV strains have been published, it was only recently that the 
complete 5 - and 3-noncoding sequences of both segments were determined. 
The 5-noncoding region of IBDV segments A and B contain a consensus 
sequence of 32 nucleotides, whereas the 3-noncoding terminal sequences 

10 of both segments are unrelated, but conserved among IBDV strains of the 
same serotype (Mundt, E. et at., Virology, 209, 10-18 (1995)). These terminii 
might contain sequences important in packaging and in the regulation of IBDV 
gene expression, as demonstrated for other dsRNA containing viruses such 
as mammalian and plant reoviruses, and rotaviruses (Anzola, et al., Proc. 

15 Natl. Acad. Sci. USA, 84, 8301-8305 (1987); Zou, S M et al., Virology, 186, 
377-388 (1992); Gorziglia, M.I., et al., Proc. Natl. Acad. Sci. USA, 89, 5784- 
5788 (1 992)). 

In recent years, a number of infectious animal RNA viruses have been 
generated from cloned cDNA using transcripts produced by DNA-dependent 

20 RNA polymerase (Boyer, J.C., et al., Virology, 198, 415-426 (1994)). For 
example poliovirus, a plus-stranded RNA virus; influenza virus, a segmented 
negative-stranded RNA virus; rabies virus, a non-segmented negative- 
stranded RNA virus; all were recovered from cloned cDNAs of their respective 
genomes (vander Werf, S., etal., Proc. Natl. Acad. Sci. USA, 83, 2330-2334 

25 (1986); Enami, M., et al., Proc. Natl. Acad. Sci. USA, 87, 3802-3805 (1990); 
Schnell, M.J., et al., EMBO J., 13, 4195-4205 (1994)). For reovirus, it was 
shown that transfection of cells with a combination of SSRNA, dsRNA and in 
vitro translated reovirus products generated infectious reovirus when 
complemented with a helper virus from a different serotype (Roner, M.R., et 

30 al., Virology, 179, 845-852 (1990)). However, to date, there has been no 
report of a recovered infectious virus of segmented dsRNA genome from 
synthetic RNAs only. 



Summary of the Invention 

This invention relates to the infectious bursal disease virus (IBDV) that 
is associated with Gumboro disease of young chickens. More particularly, 
this invention relates to a system for the generation of infectious bursal 
5 disease virus (IBDV) using synthetic transcripts derived from cloned cDNA. 
The present invention will facilitate studies of the regulation of viral gene 
expression, pathogenesis and design of a new generation of live and 

inactivated vaccines. 

Detailed Des cription of the invention 

10 In an effort to develop a reverse genetics system for IBDV, three 

independent full-length cDNA clones which contain segment A of serotype I 
strain D78 or serotype II strain 23/82 and segment B of the serotype I strain 
P2, respectively, were constructed. Synthetic RNAs of segments A and B 
were produced by in vitro transcription reaction on linearized plasmids with T7 

1 5 RNA polymerase. Transcripts of these segments, either untreated or treated 
with DNase or RNase, were evaluated for the generation of infectious virus 
by transfection of Vera cells. 

The present inventors have demonstrated that synthetic transcripts 
derived from cloned DNA corresponding to the entire genome of a segmented 

20 dsRNA animal virus can give rise to a replicating virus. The recovery of 
infectious virus after transfecting cells with synthetic plus-sense RNAs derived 
from cloned cDNA of a virus with a dsRNA genome (IBDV) completes the 
quest of generating reverse infectious systems for RNA viruses. A number 
of investigators have generated infectious animal RNA viruses from cloned 

25 cDNA (Boyer, J.C., et al., Virology, 198, 415-426 (1994)). Van der Werf et al. 
were first to generate poliovirus, a plus-stranded RNA virus, using synthetic 
RNA produced by T7 RNA polymerase on cloned cDNA template (van der 
Werf, S., et al., Proc. Natl. Acad. Sci. USA, 83, 2330-2334 (1986)). later, 
Enami et al. rescued influenza virus, a segmented negative-stranded RNA 

30 virus (Enami, M., et al., Proc. Natl. Acad. Sci. USA, 87, 3802-3805 (1990)); 
and Schnell et al. generated rabies vims, a non-segmented negative-stranded 
RNA virus, from cloned cDNAs of their respective genomes (Schnell, M.J., et 
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al., EM80 J., 13, 4195-4205 (1994)). Roner et al. developed an infectious 
system for a segmented dsRNA reovirus by transfecting cells with a 
combination of synthetic ssRNA, dsRNA, in vitro translated reovirus products, 
and complemented with a helper virus of different serotype (Roner, M.R., et 
5 al., Virology, 179, 845-852 (1990)). The resulting virus was discriminated 
from the helper virus by plaque assay. However, in this system the use of a 
helper virus was necessary. In contrast, the presently described reverse 
genetics system of IBDV does not require a helper virus or other viral 
proteins. Transfection of cells with plus-sense RNAs of both segments was 

1 0 sufficient to generate infectious virus (IBDV). The fate of the additional one 
or four nucleotides, respectively, transcribed at the 3'-end of segment A was 
not determined. However, this did not prevent the replication of the viral 
dsRNA. Similar effects were observed for plus-stranded RNA viruses by 
different investigators (Boyer, J.C., et al., Virology, 198, 415-426 (1994)). 

15 Transfection of plus-sense RNAs of both segments into the same cell 

was necessary for the successful recovery of IBDV. Transfected RNAs of 
both segments had to be translated by the cellular translation machinery. The 
polyprotein of segment A was presumably processed into VP2, VP3 and VP4 
proteins which form the viral capsid. The translated protein VP1 of segment 

20 B probably acted as a RNA-dependent RNA polymerase and transcribed 
minus-strands from synthetic plus-strands of both segments, and the reaction 
products formed dsRNA. Recently, Dobos reported that in vitro transcription 
by the virion RNA-dependent RNA polymerase of infectious pancreatic 
necrosis virus (IPNV), a prototype virus of the Birnaviridae family, is primed 

25 by VP1 and then proceeds via an asymmetric, semiconservative, strand- 
displacement mechanism to synthesize only plus strands during replication 
of the viral genome (Dobos, P., Virology, 208, 10-25 (1995)). The present 
system shows that synthesis of minus-strands proceeds on the plus-strands. 
Whether the resulting transcribed minus-strand RNA serves as a template for 

30 the transcription of plus-strands or not remains the subject of further 
investigation. 



To prove that the infectious IBDV contained in the supematants of 
transfected cells was indeed derived from the synthetic transcripts, an artificial 
chimera was generated containing segment A of a serotype II strain and 
segment B of a serotype I strain. Sequence analysis verified this genome 
5 combination. The results also indicate that the terminal sequence motifs 
described by Mundt and Mailer are probably responsible for replication, 
sorting and packaging of the viral genome (Mundt, E. et al., Virology, 209, 1 0- 
18(1 995)). Presence of serotype-specific terminal sequences obviously does 
not prevent proper replication of serotype II A segment by the action of the 

10 RNA-dependent RNA polymerase VP1 of the serotype I segment B. The 
ability to create recombinant viruses will greatly help in analyzing the precise 
function of serotype-specific and serotype-common terminal sequences. 

The recovery of infectious IBDV demonstrates that only the plus-strand 
RNAs of both segments are sufficient to initiate replication of dsRNA. Thus, 

15 the results are in agreement with the general features of reovirus and 
rotavirus replication where the plus-strand RNAs serve as a template for the 
synthesis of progeny minus-strands to yield dsRNA (Schonberg, M., et al., 
Proc. Natl. Acad. Sci. Patton, J.T., Vims Res., 6, 217-233 (1986); Chen, D., 
et al., J. Virol., 68, 7030-7039 (1994)). However, the semiconservative, 

20 strand displacement mechanisms proposed by Spies et al. and Dobos could 
not be excluded (Spies, U., et al., Virus Res., 8, 127-140 (1987); Dobos, P., 
Virology, 208, 10-25 (1995)). The development of a reverse genetics system 
for IBDV will greatly facilitate future studies of gene expression, pathogenesis, 
and help in the design of new generations of live and inactivated IBDV 

25 vaccines. 

As used in the present application, the term "synthetic" as applied to 
nucleic acids indicates that it is a man made nucleic acid in contrast to a 
naturally occurring nucleic acid. The term implies no limitation as to the 
method of manufacture, which can be chemical or biological as long as the 

30 method of manufacture involves the intervention of man. 

The term "cDNA" is intended to encompass any cDNA containing 
segments A and B and the 5* and 3' noncoding regions of segments A and B. 



The term "infectious" as applied to viruses indicates that the virus has 
the ability to reproduce. The virus can be pathogenic or nonpathogenic and 
still be infectious. 

The present invention provides a system for the generation of 

5 infectious bursal disease virus using synthetic RNA transcripts. This system 
can be used to study the regulation of viral gene expression, pathogenesis, 
and for the design of a new generation of live and inactivated IBDV vaccines. 

The present invention provides a recombinant vector containing at 
least one copy of the cDNA according to the present invention. The 

1 0 recombinant vector may also comprise other necessary sequences such as 
expression control sequences, markers, amplifying genes, signal sequences, 
promoters, and the like, as is known in the art. Useful vectors for this purpose 
are plasmids, and viruses such as baculoviruses, herpes virus (HVT) and pox 
viruses, e.g., fowl pox virus, and the like. 

15 Also provided herein is a host cell transformed with the recombinant 

vector of the present invention or a host cell transfected with the synthetic 
RNA of the present invention. The host cell may be a eukaryotic or a 
prokaryotic host cell. Suitable examples are £ co//, insect cell lines such as 
Sf-9, chicken embryo fibroblast (CEF) cells, chicken embryo kidney (CEK) 

20 cells, African green monkey Vero cells and the like. 

Also part of this invention is an IBDV poultry vaccine comprising a 
poultry protecting amount of a recombinant^ produced virus or portion of a 
virus, wherein the virus is inactivated or modified such that it is no longer 
virulent. 

25 The virus can be inactivated by chemical or physical means. Chemical 

inactivation can be achieved by treating the virus with, for example, enzymes, 
formaldehyde, p-propiolactone, ethylene-imine or a derivative thereof, an 
organic solvent (e.g. halogenated hydrocarbon) and or a detergent. If 
necessary, the inactivating substance can be neutralized after the virus has 

30 been inactivated. Physical inactivation can be carried out by subjecting the 
viruses to radiation such as UV light, X-radiation, or v-radiation. 
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The virus can be attenuated by known methods including serial 
passage, deleting sequences of nucleic acids and site directed mutagenesis 
either before or after production of the infectious virus to produce a virus 
which retains sufficient antigenicity but which has reduced virulence. 
5 Physiologically acceptable carriers for vaccination of poultry are known 

in the art and need not be further described herein. In addition to being 
physiologically acceptable to the poultry the carrier must not interfere with the 
immunological response elicited by the vaccine and/or with the expression of 
its polypeptide product. 

10 Other additives, such as adjuvants and stabilizers, among others, may 

also be contained in the vaccine in amounts known in the art. Preferably, 
adjuvants such as aluminum hydroxide, aluminum phosphate, plant and 
animal oils, and the like, are administered with the vaccine in amounts 
sufficient to enhance the immune response to the IBDV. The amount of 

15 adjuvant added to the vaccine will vary depending on the nature of the 
adjuvant, generally ranging from about 0.1 to about 100 times the weight of 
the IBDV, preferably from about 1 to about 10 times the weight of the IBDV. 

The vaccine of the present invention may also contain various 
stabilizers. Any suitable stabilizer can be used including carbohydrates such 

20 as sorbitol, mannitol, starch, sucrose, dextrin, or glucose; proteins such as 
albumin or casein; and buffers such as alkaline metal phosphate and the like. 
A stabilizer is particularly advantageous when a dry vaccine preparation is 
prepared by lyophilization. 

The vaccine can be administered by any suitable known method of 

25 inoculating poultry including nasally, ophthalmically, by injection, in drinking 
water, in the feed, by exposure, and the like. Preferably, the vaccine is 
administered by mass administration techniques such as by placing the 
vaccine in drinking water or by spraying the animals 1 environment. When 
administered by injection, the vaccines are preferably administered 

30 parenterally. Parenteral administration as used herein means administration 
by intravenous, subcutaneous, intramuscular, or intraperitoneal injection. 



g 

The vaccine of the present invention is administered to poultry to 
prevent IBD anytime before or after hatching. Preferably, the vaccine is 
administered prior to the time of birth and after the animal is about 6 weeks 
of age. Poultry is defined to include but not be limited to chickens, roosters, 
5 hens, broilers, roasters, breeders, layers, turkeys and ducks. 

The vaccine may be provided in a sterile container in unit form or in 
other amounts. It is preferably stored frozen, below -20°C, and more 
preferably below -70°C. It is thawed prior to use, and may be refrozen 
immediately thereafter. For administration to poultry the recombinants 

10 produced virus may be suspended in a carrier in an amount of about 10 4 to 
1 0 7 pfu/ml, and more preferably about 10 s to 10 6 pfu/ml in a carrier such as 
a saline solution. The inactivated vaccine may contain the antigenic 
equivalent of 10 4 to 10 7 pfu/ml suspended in a carrier. Other carriers may 
also be utilized as is known in the art. Examples of pharmaceutical^ 

15 acceptable carriers are diluents and inert pharmaceutical carriers known in 
the art. Preferably, the carrier or diluent is one compatible with the 
administration of the vaccine by mass administration techniques. However, 
the carrier or diluent may also be compatible with other administration 
methods such as injection, eye drops, nose drops, and the like. 

20 The invention also can be used to produce combination vaccines with 

the IBD V material. The IBDV material can be combined with antigen material 
of Newcastle Disease Virus Infectious Bronchitis virus, Reo virus, Adeno virus 
and/or the Marek virus. 

The foregoing embodiments of the present invention are further 

25 described in the following Examples. However, the present invention is not 
limited by the Examples, and variations will be apparent to those skilled in the 
art without departing from the scope of the present invention. 
Brief Descript ion of the Drawings 

Figure 1 is a schematic diagram of cDNA constructs used for synthesis 

30 of plus-sense ssRNAs of IBDV with T7 RNA polymerase. Construct 
pUC19FLAD78 contains the cDNA of segment A of IBDV strain D78 and the 
recombinant plasmid pUC18FLA23 contains the full-length cDNA of segment 
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A of IBDV strain 23/82. Segment A of IBDV encodes the polyprotein (VP2- 
VP4-VP3), and the recently identified VP5 protein. Plasmid pUC18FLBP2 
contains the cDNA of segment B of strain P2 which encodes the RNA- 
dependent RNA polymerase (VP1). Virus specific sequences are underlined 
5 and the 17 promoter sequences are italicized. Restriction sites are shown in 
boldface and identified. The cleavage sites of the linearized plasmids are 
shown by vertical arrows and the transcription directions are marked by 
horizontal arrows. 

Figure 2 shows an agarose gel analysis of the transcription reaction 

10 products that were used for transfection of Vero cells. Synthetic RNAs 
transcribed in vitro using 17 RNA polymerase and linearized plasmids 
pUC1 9FLAD78 (lanes 2, 4 and 6) containing the cDNA of segment A of IBDV 
strain D78, and pUC18FLBP2 (lanes 1, 3 and 5) containing the cDNA of 
segment B of strain P2, respectively. After transcription, the reaction mixtures 

1 5 were either treated with DNase (lanes 1 and 2), RNase (lanes 3 and 4) or left 
untreated (lanes 5 and 6). Two pi of the reaction products were analyzed on 
1% agarose gel. Lambda DNA, digested with Hind \WEcoR I, was used as 
markers (lane M). 

Figure 3 shows a comparison of nucleotide sequences of cloned RT- 

20 PCR fragments from segments A and B of the chimeric IBDV strain 23A/P2B 
(bold-typed) with known sequences of segments A and B of serotype II strain 
23/82 and serotype I strain P2, respectively. Nucleotide identities are marked 
by a colon. 

Figure 4 shows the DNA sequence of pUC18FLA23. 
25 Figure 5 shows the DNA sequence of pUC1 9FLAD78. 

Figure 6 shows the DNA sequence of pUC1 8FLBP2. 

EXAMPLES 

Viruses and Cells. Two serotype I strains of IBDV, the attenuated P2 
strain from Germany and the vaccine strain D78 (Intervet International), and 
30 one serotype II strain, the apathogenic 23/82 strain, were propagated in 
chicken embryo cells (CEC) and purified (Mundt, E. et al., Virology, 209, 10- 
18 (1995); Vakharia, V.N., et al M Virus Res., 31, 265-273 (1994)). Vero cells 



were grown in M 199 medium supplemented with 5% fetal calf serum (FCS) 
and used for transfection experiments. Further propagation of the recovered 
virus and immunofluorescence studies were earned out in Vera cells (Mundt, 
E., et al., J. Gen. Virol., 76, 437-443, (1995)). For plaque assay, monolayers 
5 of secondary CEC were prepared and used (Muller, H., et al., Virus Res., 4, 
297-309 (1986)). 

Construction of Full-Length cDNA Clones of IBDV genome. Full- 
length cDNA clones of IBDV segments A and B were independently prepared. 
The cDNA clones containing the entire coding region of the RNA segment A 

10 of strain D78 were prepared using standard cloning procedures and methods 
(Vakharia, V.N., et al., Virus Res., 31, 265-273 (1994)). By comparing the 
D78 terminal sequences with recently published terminal sequences of other 
IBDV strains (Mundt, E. et al., Virology, 209, 10-18 (1995)), it was observed 
that D78 cDNA clones lacked the conserved first 17 and last 10 nucleotides 

15 at the 5 - and 3'-ends, respectively. Therefore, to construct a full-length cDNA 
clone of segment A, two primer pairs (A5'-D78, A5-IPD78 and A3'-IPD78) 
were synthesized and used for PCR amplification (Table 1). The DNA 
segments were amplified according to the protocol of the supplier (New 
England Biolabs) using "Deep Vent Polymerase" (high fidelity thermophilic 

20 DNA polymerase). Amplified fragments were cloned into the EcoR I site of a 
pCRII vector (Invitrogen Corp.) to obtain plasmids pCRD78A5' and 
pCRD78A3\ respectively. Each plasmid was digested with EcoR I and Sal 
I and the resultant fragments were ligated into EcoR I digested pUC19 to 
obtain plasmid pUC19FLAD78 (SEQ ID NOS:27 AND 29) which now contains 

25 a full-length cDNA copy of segment A encoding all the structural proteins 
(VP2, VP4 and VP3, SEQ ID NO:30) as well as the non-structural VP5 protein 
(SEQ ID NO:28) (Fig. 1). 

Two primer pairs (A5'-23, A5IP23 and A3-23, A3-IP23; see Table 1) 
were used for reverse transcription (RT) of viral genomic dsRNA of strain 

30 23/82 using "Superscript RT II" (RNA directed DNA polymerase with reduced 
RNase H activity, GIBCO/BRL). The RT reaction products were purified by 
phenol/chloroform extraction and ethanol precipitation. To obtain two cDNA 
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fragments bounded by primer pairs A5-23, A5-IP23 and A3-23, A3-IP23, 
respectively, RT reaction products were amplified by PCR using "Deep Vent 
polymerase". Both RT and PCR were carried out according to the supplier's 
protocol. Resulting PCR fragments were blunt-end ligated into Sma I cleaved 
5 pUC1 8 vector to obtain pUC23A5' and pUC23A3'. The 3'-end of segment A 
contained in plasmid pUC23A3' was ligated into the Hind \U-BstB I cleaved 
plasmid p(JC23A5' to establish the full-length cDNA of segment A of strain 
23/82. The resulting plasmid was termed pUC18FLA23 (SEQ ID NOS: 31 
AND 33)(Fig. 1) and encodes structural proteins VP2, VP3 and VP4 (SEQ ID 

10 NO: 32) and non-structural protein VP5 (SEQ ID NO: 34) 

To obtain cDNA clones of segment B of P2 strain, two primer pairs 
(B5-P2, B5-IPP2 and B3-P2, B3-IPP2) were designed according to the 
published sequences and used for RT-PCR amplification (see Table 1). 
Using genomic dsRNA as template, cDNA fragments were synthesized and 

1 5 amplified according to the supplier's protocol (Perkin-Elmer Cetus). Amplified 
fragments were blunt-end ligated into Sma I cleaved pBS vector (Stratagene) 
to obtain clones pBSP2B5' and pBSP2B3'. To construct a full-length clone 
of segment B, the 5'-end fragment of plasmid pBSP2B5' was first subcloned 
between EcoR I and Pst I sites of pUC18 vector to obtain pUCP2B5\ Then 

20 the 3'-end fragment of plasmid pBSP2B3' was inserted between the unique 
Bgl II and Pst I sites of plasmid pUCP2B5' to obtain a full-length plasmid 
pUC18FLBP2 (SEQ ID NO:25) which encodes the VP 1 protein (SEQ ID NO: 
26) (Fig. 1). Plasmids pUC18FLBP2, pUC18FLA23 and pUC19FLAD78 were 
completely sequenced by using the "Sequenase" DNA sequencing system 

25 (U.S. Biochem.), and the sequence data were analyzed using either 
"DNASIS" (Pharmacia) or "PC/Gene" (Intelligenetics) software. The integrity 
of the full-length constructs was tested by in vitro transcription and translation 
coupled reticulocyte lysate system using T7 RNA polymerase (Promega). 
Transcription and Transfection of Synthetic RNAs. Plasmids 

30 pUC1 9FLAD78, pUC1 8FLA23 and pUC1 8FLBP2 were digested with BsrG I, 
Nsi I and Pst I enzymes (see Fig. 1), respectively, and used as templates for 
in vitro transcription with T7 RNA polymerase (Promega). Briefly, restriction 
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enzyme cleavage assays were adjusted to 0.5% SDS and incubated with 
proteinase K (0.5 mg/ml) for 1 hour at 37°C. The linearized DNA templates 
(-3 ug) were recovered after ethanoi precipitation, and were added separately 
to a transcription reaction mixture (50 pi) containing 40 mM Tris-HCI (pH 7.9), 

5 1 0 mM NaCI, 6 mM MgCI 2 , 2 mM spermidine, 0.5 mM ATP, CTP and UTP 
each, 0.1 mM GTP, 0.25 mM cap analog [m7G(5') PPP(5*) G], 120 units of 
"RNasin" (ribonudease inhibitor), 150 units T7 RNA polymerase (Promega), 
and incubated at 37°C for 1 hour. Synthetic RNA transcripts were purified by 
phenol/chloroform extraction and ethanoi precipitation. As controls, the 

10 transcription products were treated with either DNase or RNase (Promega) 
before the purification step. 

Vero cells were grown to 80% confluence in 60 mm dishes and washed 
once with phosphate-buffered saline (PBS). Three ml of "OPTI-MEM I" 
(reduced serum medium containing HEPES buffer, sodium bicarbonate, 

15 hypoxanthine, thymidine, sodium pyruvate, L-glutamine, trace elements, 
growth factors and phenol red; from GIBCO/BRL) were added to the 
monolayers, and the cells were incubated at 37°C for 1 hour in a C0 2 
incubator. Simultaneously. 0.15 ml of "OPTI-MEM I" was incubated with 1.25 
ug of "Lipofectin" reagent (N-[1-(2,3-dioleyloxy)propyll-N,N,N- 

20 trimethylammonium chloride and dioleoylphosphatidylethanolamine, 
GIBCO/BRL) for 45 min. in a polystyrene tube at room temperature. Synthetic 
RNA transcripts of both segments, resuspended in 0.15 ml of diethyl 
pyrocarbonate-treated water, were added to the OPTI-MEM-Lipofectin- 
mixture, mixed gently, and incubated on ice for 5 min. After removing the 

25 "OPTI-MEM" from the monolayers in 60 mm dishes and replacing with fresh 
1 .5 ml of "OPTI-MEM", the nucleic acid containing mixture was added drop- 
wise to the Vero cells and swirled gently. After 2 hours of incubation at 37°C, 
the mixture was replaced with M199 medium [CaCI 2 (annhydrous), Fe(N0 3 ) 3 
9H 2 0, KCI, MgS0 4 (anhydrous), NaCI, NaH 2 P0 4 H 2 0, NaHC0 3 , L-Alanine, L- 

30 Arginine HCl, L-Aspartic acid, L-Cysteine HCI H 2 0, L-Cysteine 2HCI, L- 
Glutamic acid, L-Glutamine, Glycine, L-Histidine HCL H 2 0, L-Hydroxyproline, 
L-lsoleucine, L-Leucine, L-Lysine HCI, L-Methionine, L-Phenylalanine, L- 



PCTAJS97/12955 



14 

Proline, L-Serine, L-Threonine, L-Tryptophan, L-Tyrosine 2Na 2H 2 0, L-Valine, 
Alpha tocopherol P0 4 Na 2 , Ascorbic Acid, Biotin, Calciferol, D-Calcium 
pantothenate, Choline chloride, Folic acid, l-lnosltol, Menandione NaHS0 3 
3H 2 0, Niacin, Nicotinamide, Para-aminobenzoic acid, Pyridoxine HCI, 
5 Riboflavin, Thiamine HCI, Vitamin A Acetate, Adenine S0 4 , Adenylic Acid, 
ATP, Na 2 , Cholesterol, 2-Deoxy-D-Ribose, D-Glucose, Glutathione, Guanine 
HCI, Hypoxanthine Na, Phenol Red Na, Ribose, Sodium Acetate (anhydrous), 
Thymine, Tween 80, Uracil.and Xanthine Na; from Mediatech, Inc.] containing 
5% FCS (without rinsing cells) and the cells were further incubated at 37°C 

10 for desired time intervals. 

identification of Generated IBDV. CEC were infected with filtered 
(0.2 urn) supernatant from Vero cells transfected with transcripts of 
pUC18FLA23 and pUC18FLP2B. 16 hours post-infection, the whole cell 
nucleic acids were isolated (Mundt, E. et al., Virology, 209, 10-18 (1995)). 

1 5 Primers were designed according to the published sequences and RT-PCR 
fragments were amplified, cloned and sequenced (Mundt, E. et at., Virology, 
209, 10-18 (1995)). Sequence data were analyzed by using "DNASIS" 
software. 

20 Immunofluorescence. Vera cells, grown on cover slips to 80% 

confluence, were infected with the supematants derived from transfected 
Vero cells (after freeze-thawing) and incubated at 37°C for two days. The 
cells were then washed, fixed with acetone and treated with polyclonal rabbit 
anti-IBDV serum. After washing, the cells were treated with fluorescein 

25 labeled goat-anti-rabbit antibody (Kirkegaard & Perry Lab.) and examined by 
fluorescence microscope. 

Plaque Assay. Monolayers of secondary CEC, grown in 60 mm 
dishes, were inoculated with the supematants derived from transfected Vero 
cells. After 1 hour of infection, the cells were washed once with PBS and 

30 overlayed with 0.8% Agar noble (Difco) containing 10% tryptose phosphate 
broth, 2% FCS, 0.1 12% NaHC0 3 , 10 3 units penicillin, 10 3 ug/ml streptomycin, 
0.25 ug/ml fungizone, 0.005% neutral red, 0.0015% phenol red. The cells 
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were incubated at 37°C for 2 to 3 days until plaques could be observed and 
counted (MUller, H., et at., Virus Res., 4, 297-309 (1986)). 

Construction of Full-Length cDNA clones of IBDV Genome. To 
develop a reverse genetics system for the dsRNA virus IBDV, two 

5 independent cDNA clones were constructed that contain segment A of strain 
D78 and segment B of strain P2 (Fig. 1). Each plasmid encoded either the 
precursor of structural proteins (VP2, VP4, VP3) and VP5 or only VP1 protein 
(RNA-dependent RNA polymerase). Plasmid pUC18FLBP2 upon digestion 
with Pst I and transcription in vitro by T7 RNA polymerase, would yield RNA 

1 0 containing the correct 5 - and 3'-ends. Whereas, upon digestion with BsrG I 
and transcription, plasmid pUC19FLAD78 would yield RNA containing the 
correct 5'-end but with additional four nucleotides at the 3'end. Coupled 
transcription and translation of the above plasmids in a rabbit reticulocyte 
system yielded protein products that were correctly processed and comig rated 

15 with the marker IBDV proteins after fractionating on SDS-polyacrylamide gel 
and autoradiography (data not shown). 

Transcription, Transfection and Generation of Infectious Virus. 
Plus-sense transcripts of IBDV segment A and B were synthesized separately 

20 in vitro with T7 RNA polymerase using linearized full-length cDNA plasmids 
as templates (see Fig. 2). Although two species of RNA transcripts were 
observed for segment B on a neutral gel (lanes 1 and 5), fractionation of 
these samples on a denaturing gel yielded only one transcript-specific band 
(data not shown). In order to show that plus-sense RNA transcripts of both 

25 segments are needed for the generation of infectious virus, the transcription 
mixtures were incubated with different nucleases, as shown in Fig. 2. 
Synthetic RNAs recovered after treating the transcription products with DNase 
(lanes 1+2), RNase (lanes 3+4) or without treatment (lanes 5+6), were used 
for the transfection of Vera cells. As mock control, Lipofectin alone was used. 

30 Five days post-transfection, cytopathic effect (CPE) was only visible in Vera 
cells transfected with combined transcripts of untreated or DNase-treated 
transcription products, but not with RNase-treated transcription mixtures or 
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mock-transfected control. In addition, no CPE was detected when Vero cells 
were transfected with RNA of only segment A or B (data not shown). These 
results demonstrate that replication of IBDV ensued after transfection of Vero 
cells with plus-sense ssRNAs of both segments of IBDV. To verify that the 
5 agent causing the CPE in Vero cells was indeed IBDV, transfected Vero cells 
were freeze-thawed, and supernatants were clarified by centrifugation, and 
used to infect CEC or Vero cells. CEC infected with the supernatants derived 
from Vero transfected cells of untreated or DNase-treated transcription 
mixtures produced CPE in one day post-inoculation (Table 2). However, no 

10 CPE could be detected even after five days in CEC, with the supernatants 
from transfected Vero cells of RNase-treated transcription mixtures, untreated 
segment A or B transcription mixtures and mock-transfected Vero cells. 
Similarly, when Vero cells on cover slips were infected with the same 
supernatants as described above and examined by immunofluorescence 

1 5 staining after 2 days, only supernatants derived from transfected Vero cells 
of untreated or DNAse-treated transcription mixtures gave positive 
immunofluorescence signal (Table 2). 

Recovery of Transfectant Virus. To determine the time point for the 
recovery of infectious virus, Vero cells were transfected with combined RNA 

20 transcripts of segments A and B. At 4, 8, 16, 24, 36 and 48 hours 

transfection, the supernatants were examined for the presence of transfectant 
virus by infectivity and plaque assays, as shown in Table 3. Our results 
indicate that the virus could be recovered as early as 36 hours after 
transfection. Virus titer was 2.3 x 1 0 2 pfu/ml which appear to drop for samples 

25 obtained later than 48 hours after transfection. 



Generation of a Chimeric Virus. To prove that plus-sense ssRNA of 
both segments of IBDV are sufficient for recovery of infectious virus, a 
chimeric IBDV was generated. Plasmid pUC18FLA23 containing a full-length 
sequence of segment A of serotype II strain was linearized by Ate/ I digestion 
30 and ssRNA was synthesized in vitro using T7 RNA polymerase. The ssRNA 
transcript specifies the correct 5'-end but contains one additional residue at 
the 3'-end (Fig. 1 ). Vero cells were transfected with ssRNA of segment A of 
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serotype II strain 23/82 and ssRNA of segment B of serotype I strain P2. Five 
days after transfection when CPE was evident, the supernatant was clarified 
(after freeze-thawing) and used to infect CEC. After a second passage in 
CEC, genomic RNA of the virus was analyzed by RT-PCR and sequencing 
of the PCR products. Primers for segment A were deigned to specifically 
amplify only segment A sequences derived from the serotype II strain. Primer 
for segment B bound to sequences of both serotypes. The amplified 
fragments were cloned and sequenced. The obtained segment A sequences 
showed a perfect match with known segment A sequences of serotype II 
strain 23/82, whereas segment B sequence exhibited complete homology to 
published segment B sequences of serotype I strain P2 (Fig. 3). 
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Table 2. Generation of Infections IBDV From Synthetic RNAs of Segment A and B. 



_ ft 

Material Transfected 


CPc 


irnrnunoTiuoroescence 


SSRNA /V^D, UPlaatruieaKSU 


+ 


+ 


ssRNA A+B, RNase-treated 






... * 

ssRNA A+B, untreated 


+ 


+ 


ssRNA A, untreated 






ssRNA B, untreated 






Lipofectin only 







Vera cells were transfected with synthetic RNAs of segment A and B derived from 
transcription reactions that were either untreated or treated with DNase or RNase. After 
5 days the supernatants were collected, clarified by centrifugation, and analyzed for 
the presence of virus. The infectivity of the recovered virus was determined in CEC by 
the appearance of cytopathic effect (CPE) 1-2 days post-inoculation. The specificity of 
the recovered virus was determined by immunofluorescence staining of infected Vera 
cells with rabbit anti-IBDV serum. 
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Table 3. Recovery of Virus at Various Times Post-Transfection. 



Time in hours CPE Immunofluorescence pfu/ml 

post-transfection 



4 






0 


8 






0 


16 






0 


24 






0 


36 


+ 


+ 


2.3 * in 2 


48 


+ 


+ 


6.0 x 10 1 



Vero cells were transfected with synthetic RNAs of segment A and B as described; The 
infertility and specificity of the recovered virus was detected by CPE in CEC and 
immunofluorescence staining in Vero cells, respectively. Monolayers of secondary 
CEC were used for plaque assay after inoculating the cells with the supematants 
derived from transfected Vero cells. Approximate titer of the virus was calculated as 
plaque forming units per ml (pfu/ml). 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GAATTCGGCT TTAATACGAC TCACTATAGG ATACGATCGG TCTGAC 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
AATTGGATCC GTTCGCGGGT CCCCTGTACA AAGCCGAATT C 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CGGCGAATTC ATGCATAGGG GACCCGCGAA CGGATC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GTCAGACCGA TCGTATCCTA TAGTGAGTCG TATTAGAATT CTCT 



(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: CDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TTGCATGCCT GCAGGGGGCC CCCGCAGGCG AAG 33 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCGTATCCTA TAGTGAGTCG TATTAGAATT C 31 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGAAGCCTGA GTGAGTTGAC TGACTACAGC TACAACGGGC TGATGTCAGC CACTGCGAAC 60 
ATCAACGACA AGATCGGGAA CGTTCTAGTT GGAGAAGGGG TGACTGTTCT CAGTCTACCG 120 



(2) INFORMATION FOR SEQ ID NO: 8: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY; linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGAAGCCTGA GTGAGTTGAC TGACTACAGC TACAACGGGC TGATGTCAGC CACTGCGAAC 60 
ATCAACGACA AGATCGGGAA CGTTCTAGTT GGAGAAGGGG TGACTGTTCT CAGTCTACC 119 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGAAGCCTGA GTGAACTGAC AGATGTTAGC TACAATGGGT TGATGTCTGC AACAGCCAAC 60 
ATCAACGACA AAATTGGGAA CGTCCTAGTA GGGGAAGGGG TCACCGTCCT CAGCTTACCC 120 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTTTCAATAG TCCACAGGCG CGAACGAAGA TCTCAGCAGC GTTCGGCATA AAGCCTACTG 60 
CTGGACAAGA CGTGGAAGAA CTCTTGATCC CCAAAGTCTG GGTGCCACCT GAGGATCCGC 120 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 : 
TTTTCAACAG TCCACAGGCG CGAAGCACGA TCTCAGCAGC GTTCGGCATA AAGCCTACTG 
CTGGACAAGA CGTGGAAGAA CTCTTGATCC CTAAAGTTTG GGTGCCACCT GAGGATCCGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTTTCAACAG TCCACAGGCG CGAAGCACGA TCTCAGCAGC GTTCGGCATA AAGCCTACTG 
CTGGACAAGA CGTGGAAGAA CTCTTGATCC CTAAAGTTTG GGTGCCACCT GAGGATCCGC 



(2) INFORMATION FOR SEQ ID NO: 13: 

■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TAATACGACT CACTATAGGA TACGATCGGT CTGACCCCGG GGGAGTCA 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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AGAGAATTCT AATACGACTC ACTATAGGAT ACGATCGGTC TGAC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TGTACAGGGG ACCCGCGAAC GGATCCAATT 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGCGAATTC ATGCATAGGG GACCCGCGAA CGGATC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGTCGACTAC GGGATTCTGG 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 



WO 98/09646 



27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CAGAGGCAGT ACTCCGTCTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
AGTCGACGGG ATTCTTGCTT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GAAGGTGTGC GAGAGGAC 



( 2 ) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
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AGAGAATTCT AATACGACTC ACTATAGGAT ACGATGGGTC TGAC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

* * 

CGATCTGCTG CAGGGGGCCC CCGCAGGCGA AGG 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CTTGAGACTC TTGTTCTCTA CTCC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

« 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATACAGCAAA GATCTCGGG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2827 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 112.. 2745 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GGATACGATG GGTCTGACCC TCTGGGAGTC ACGAATTAAC GTGGCTACTA GGGGCGATAC 60 

* ■ • 

CCGCCGCTGG CCGCCACGTT AGTGGCTCCT CTTCTTGATG ATTCTGCCAC C ATG AGT 117 

Met Ser 
1 

GAC ATT TTC AAC AGT CCA CAG GCG CGA AGC ACG ATC TCA GCA GCG TTC 165 
Asp He Phe Asn Ser Pro Gin Ala Arg Ser Thr He Ser Ala Ala Phe 

5 10 15 



GGC ATA AAG CCT ACT GCT GGA CAA GAC GTG GAA GAA CTC TTG ATC CCT 213 
Gly He Lys Pro Thr Ala Gly Gin Asp Val Glu Glu Leu Leu He Pro 
20 25 30 

AAA GTT TGG GTG CCA CCT GAG GAT CCG CTT GCC AGC CCT AGT CGA CTG 261 

Lys Val Trp Val Pro Pro Glu Asp Pro Leu Ala Ser Pro Ser Arg Leu 
35 40 45 50 

GCA AAG TTC CTC AGA GAG AAC GGC TAC AAA GTT TTG CAG CCA CGG TCT 309 
Ala Lys Phe Leu Arg Glu Asn Gly Tyr Lys Val Leu Gin Pro Arg Ser 

55 60 65 

CTG CCC GAG AAT GAG GAG TAT GAG ACC GAC CAA ATA CTC CCA GAC TTA 357 

Leu Pro Glu Asn Glu Glu Tyr Glu Thr Asp Gin He Leu Pro Asp Leu 

70 75 80 

GCA TGG ATG CGA CAG ATA GAA GGG GCT GTT TTA AAA CCC ACT CTA TCT 405 

Ala Trp Met Arg Gin He Glu Gly Ala Val Leu Lys Pro Thr Leu Ser 

85 90 95 

CTC CCT ATT GGA GAT CAG GAG TAC TTC CCA AAG TAC TAC CCA ACA CAT 453 
Leu Pro He Gly Asp Gin Glu Tyr Phe Pro Lys Tyr Tyr Pro Thr His 
100 105 HO 

CGC CCT AGC AAG GAG AAG CCC AAT GCG TAC CCG CCA GAC ATC GCA CTA 501 
Arg Pro Ser Lys Glu Lys Pro Asn Ala Tyr Pro Pro Asp He Ala Leu 
115 120 125 130 
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CTC AAG CAG ATG ATT TAC CTG TTT CTC CAG GTT CCA GAG GCC AAC GAG 549 

Leu Lys Gin Met lie Tyr Leu Phe Leu Gin Val Pro Glu Ala Asn Glu 

135 140 145 

GGC CTA AAG GAT GAA GTA ACC CTC TTG ACC CAA AAC ATA AGG GAC AAG 597 
Gly Leu Lys Asp Glu Val Thr Leu Leu Thr Gin Asn He Arg Asp Lys 

150 155 160 

GCC TAT GGA AGT GGG ACC TAC ATG GGA CAA GCA AAT CGA CTT GTG GCC 645 

Ala Tyr Gly Ser Gly Thr Tyr Met Gly Gin Ala Asn Arg Leu Val Ala 

165 170 175 

ATG AAG GAG GTC GCC ACT GGA AGA AAC CCA AAC AAG GAT CCT CTA AAG 693 

Met Lys Glu Val Ala Thr Gly Arg Asn Pro Asn Lys Asp Pro Leu Lys 
180 185 190 

CTT GGG TAC ACT TTT GAG AGC ATC GCG CAG CTA CTT GAC ATC ACA CTA 741 

Leu Gly Tyr Thr Phe Glu Ser He Ala Gin Leu Leu Asp He Thr Leu 
195 200 205 210 

■ • 

CCG GTA GGC CCA CCC GGT GAG GAT GAC AAG CCC TGG GTG CCA CTC ACA 789 

Pro Val Gly Pro Pro Gly Glu Asp Asp Lys Pro Trp Val Pro Leu Thr 

215 220 225 

AGA GTG CCG TCA CGG ATG TTG GTG CTG ACG GGA GAC GTA GAT GGC GAC 837 

Arg Val Pro Ser Arg Met Leu Val Leu Thr Gly Asp Val Asp Gly Asp 

230 235 240 

TTT GAG GTT GAA GAT TAC CTT CCC AAA ATC AAC CTC AAG TCA TCA AGT 885 
Phe Glu Val Glu Asp Tyr Leu Pro Lys He Asn Leu Lys Ser Ser Ser 
245 250 



GGA CTA CCA TAT GTA GGT CGC ACC AAA GGA GAG ACA ATT GGC GAG ATG 933 

Gly Leu Pro Tyr Val Gly Arg Thr Lys Gly Glu Thr He Gly Glu Met 
260 265 270 

ATA GCT ATC TCA AAC CAG TTT CTC AGA GAG CTA TCA ACA CTG TTG AAG 981 
He Ala He Ser Asn Gin Phe Leu Arg Glu Leu Ser Thr Leu Leu Lys 
275 280 285 290 

CAA GGT GCA GGG ACA AAG GGG TCA AAC AAG AAG AAG CTA CTC AGC ATG 1029 
Gin Gly Ala Gly Thr Lys Gly Ser Asn Lys Lys Lys Leu Leu Ser Met 

295 300 305 

TTA AGT GAC TAT TGG TAC TTA TCA TGC GGG CTT TTG TTT CCA AAG GCT 1077 
Leu Ser Asp Tyr Trp Tyr Leu Ser Cys Gly Leu Leu Phe Pro Lys Ala 

310 315 320 

GAA AGG TAC GAC AAA AGT ACA TGG CTC ACC AAG ACC CGG AAC ATA TGG 1125 
Glu Arg Tyr Asp Lys Ser Thr Trp Leu Thr Lys Thr Arg Asn He Trp 
325 330 335 
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TCA GCT CCA TCC CCA ACA CAC CTC ATG ATC TCT ATG ATC ACC TGG CCC 1173 
Ser Ala Pro Ser Pro Thr His Leu Met He Ser Met He Thr Trp Pro 
340 345 350 

GTG ATG TCC AAC AGC CCA AAT AAC GTG TTG AAC ATT GAA GGG TGT CCA 1221 

Val Met Ser Asn Ser Pro Asn Asn Val Leu Asn He Glu Gly Cys Pro 

355 360 365 370 

i 

TCA CTC TAC AAA TTC AAC CCG TTC AGA GGA GGG TTG AAC AGG ATC GTC 1269 

Ser Leu Tyr Lys Phe Asn Pro Phe Arg Gly Gly Leu Asn Arg lie Val 

375 380 385 

GAG TGG ATA TTG GCC CCG GAA GAA CCC AAG GCT CTT GTA TAT GCG GAC 1317 
Glu Trp He Leu Ala Pro Glu Glu Pro Lys Ala Leu Val Tyr Ala Asp 

390 395 400 

AAC ATA TAC ATT GTC CAC TCA AAC ACG TGG TAC TCA ATT GAC CTA GAG 1365 
Asn lie Tyr He Val His Ser Asn Thr Trp Tyr Ser lie Asp Leu Glu 
405 410 415 

AAG GGT GAG GCA AAC TGC ACT CGC CAA CAC ATG CAA GCC GCA ATG TAC 1413 

Lys Gly Glu Ala Asn Cys Thr Arg Gin His Met Gin Ala Ala Met Tyr 

420 425 430 

TAC ATA CTC ACC AGA GGG TGG TCA GAC AAC GGC GAC CCA ATG TTC AAT 1461 
Tyr He Leu Thr Arg Gly Trp Ser Asp Asn Gly Asp Pro Met Phe Asn. 

440 445 450 



CAA ACA TGG GCC ACC TTT GCC ATG AAC ATT GCC CCT GCT CTA GTG GTG 1509 
Gin Thr Trp Ala Thr Phe Ala Met Asn He Ala Pro Ala Leu Val Val 

455 460 465 

GAC TCA TCG TGC CTG ATA ATG AAC CTG CAA ATT AAG ACC TAT GGT CAA 1557 

Asp Ser Ser Cys Leu He Met Asn Leu Gin He Lys Thr Tyr Gly Gin 

470 475 480 

GGC AGC GGG AAT GCA GCC ACG TTC ATC AAC AAC CAC CTC TTG AGC ACA 1605 

Gly Ser Gly Asn Ala Ala Thr Phe He Asn Asn His Leu Leu Ser Thr 

485 490 495 

CTA GTG CTT GAC CAG TGG AAC CTG ATG AGA CAG CCC AGA CCA GAC AGC 1653 
Leu Val Leu Asp Gin Trp Asn Leu Met Arg Gin Pro Arg Pro Asp Ser 
500 505 510 

GAG GAG TTC AAA TCA ATT GAG GAC AAG CTA GGT ATC AAC TTT AAG ATT 1701 

Glu Glu Phe Lys Ser He Glu Asp Lys Leu Gly He Asn Phe Lys He 
515 520 525 530 

GAG AGG TCC ATT GAT GAT ATC AGG GGC AAG CTG AGA CAG CTT GTC CTC 1749 

Glu Arg Ser He Asp Asp He Arg Gly Lys Leu Arg Gin Leu Val Leu 

535 540 545 
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CTT GCA CAA CCA GGG TAC CTG AGT GGG GGG GTT GAA CCA GAA CAA TCC 1797 
Leu Ala Gin Pro Gly Tyr Leu Ser Gly Gly Val Glu Pro Glu Gin Ser 

550 555 560 

AGC CCA ACT GTT GAG CTT GAC CTA CTA GGG TGG TCA GCT ACA TAC AGC 1845 
Ser Pro Thr Val Glu Leu Asp Leu Leu Gly Tip Ser Ala Thr Tyr Ser 

570 575 



AAA GAT CTC GGG ATC TAT GTG CCG GTG CTT GAC AAG GAA CGC CTA TTT 1893 
Lys Asp Leu Gly He Tyr Val Pro Val Leu Asp Lys Glu Arg Leu Phe 
580 585 590 

TGT TCT GCT GCG TAT CCC AAG GGA GTA GAG AAC AAG AGT CTC AAG TCC 1941 
Cys Ser Ala Ala Tyr Pro Lys Gly Val Glu Asn Lys Ser Leu Lys Ser 

600 605 610 



AAA GTC GGG ATC GAG CAG GCA TAC AAG GTA GTC AGG TAT GAG GCG TTG 1989 
Lys Val Gly He Glu Gin Ala Tyr Lys Val Val Arg Tyr Glu Ala Leu 

615 620 625 

AGG TTG GTA GGT GGT TGG AAC TAC CCA CTC CTG AAC AAA GCC TGC AAG 2037 
Arg Leu Val Gly Gly Trp Asn Tyr Pro Leu Leu Asn Lys Ala Cys Lys 

630 635 640 

AAT AAC GCA GGC GCC GCT CGG CGG CAT CTG GAG GCC AAG GGG TTC CCA 2085 
Asn Asn Ala Gly Ala Ala Arg Arg His Leu Glu Ala Lys Gly Phe Pro 
645 650 



CTC GAC GAG TTC CTA GCC GAG TGG TCT GAG CTG TCA GAG TTC GGT GAG 2133 
Leu Asp Glu Phe Leu Ala Glu Trp Ser Glu Leu Ser Glu Phe Gly Glu 
660 665 670 

GCC TTC GAA GGC TTC AAT ATC AAG CTG ACC GTA ACA TCT GAG AGC CTA 2181 
Ala Phe Glu Gly Phe Asn He Lys Leu Thr Val Thr Ser Glu Ser Leu 
675 680 685 690 

GCC GAA CTG AAC AAG CCA GTA CCC CCC AAG CCC CCA AAT GTC AAC AGA 2229 
Ala Glu Leu Asn Lys Pro Val Pro Pro Lys Pro Pro Asn Val Asn Arg 

695 700 705 

CCA GTC AAC ACT GGG GGA CTC AAG GCA GTC AGC AAC GCC CTC AAG ACC 2277 
Pro Val Asn Thr Gly Gly Leu Lys Ala Val Ser Asn Ala Leu Lys Thr 

710 715 720 

GGT CGG TAC AGG AAC GAA GCC GGA CTG AGT GGT CTC GTC CTT CTA GCC 2325 
Gly Arg Tyr Arg Asn Glu Ala Gly Leu Ser Gly Leu Val Leu Leu Ala 
725 730 735 



ACA GCA AGA AGC CGT CTG CAA GAT GCA GTT AAG GCC AAG GCA GAA GCC 
Thr Ala Arg Ser Arg Leu Gin Asp Ala Val Lys Ala Lys Ala Glu Ala 
740 745 750 



2373 
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GAG AAA CTC CAC AAG TCC AAG CCA GAC GAC CCC GAT GCA GAC TGG TTC 2421 

Glu Lys Leu His Lys Ser Lys Pro Asp Asp Pro Asp Ala Asp Trp Phe 

755 760 765 770 

GAA AGA TCA GAA ACT CTG TCA GAC CTT CTG GAG AAA GCC GAC ATC GCC 2469 

Glu Arg Ser Glu Thr Leu Ser Asp Leu Leu Glu Lys Ala Asp He Ala 

775 780 785 

AGC AAG GTC GCC CAC TCA GCA CTC GTG GAA ACA AGC GAC GCC CTT GAA 2517 

Ser Lys Val Ala His Ser Ala Leu Val Glu Thr Ser Asp Ala Leu Glu 

790 795 800 

GCA GTT CAG TCG ACT TCC GTG TAC ACC CCC AAG TAC CCA GAA GTC AAG 2565 
Ala Val Gin Ser Thr Ser Val Tyr Thr Pro Lys Tyr Pro Glu Val Lys 
805 810 815 

AAC CCA CAG ACC GCC TCC AAC CCC GTT GTT GGG CTC CAC CTG CCC GCC 2613 
Asn Pro Gin Thr Ala Ser Asn Pro Val Val Gly Leu His Leu Pro Ala 

820 825 830 

AAG AGA GCC ACC GGT GTC CAG GCC GCT CTT CTC GCA GCA GGA ACG AGC 2661 

Lys Arg Ala Thr Gly Val Gin Ala Ala Leu Leu Gly Ala Gly Thr Ser 

840 845 850 



AGA CCA ATG GGG ATG GAG GCC CCA ACA CGG TCC. AAG AAC GCC GTG AAA 2709 
Arg Pro Met Gly Met Glu Ala Pro Thr Arg Ser. Lys Asn Ala. Val Lys 

860 865 



ATG GCC AAA CGG CGG CAA CGC CAA AAG GAG AGC CGC TAACAGCCAT 2755 

Met Ala Lys Arg Arg Gin Arg Gin Lys Glu Ser Arg 

870 875 

GATGGGAACC ACTCAAGAAG AGGACACTAA TCCCAGACCC CGTATCCCCG GCCTTCGCCT 2815 
GCGGGGGCCC CC 2827 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 878 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Ser Asp He Phe Asn Ser Pro Gin Ala Arg Ser Thr He Ser Ala 
1 5 10 15 

Ala Phe Gly He Lys Pro Thr Ala Gly Gin Asp Val Glu Glu Leu Leu 
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lie Pro Lys Val 

35 

Arg Leu Ala Lys 
50 

Arg Ser Leu Pro 
65 

Asp Leu Ala Trp 



Leu Ser Leu Pro 

100 

Thr His Arg Pro 
115 

Ala Leu Leu Lys 
130 

Asn Glu Gly Leu 
145 

Asp Lys Ala Tyr 



Val Ala Met Lys 

180 

Leu Lys Leu Gly 

195 

Thr Leu Pro Val 
210 

Leu Thr Arg Val 
225 



Gly Asp Phe Glu 



Ser Ser Gly Leu 

260 

Glu Met He Ala 
275 

Leu Lys Gin Gly 



Trp Val Pro Pro 

40 

Phe Leu Arg Glu 

55 

Glu Asn Glu Glu 
70 

Met Arg Gin He 
65 

He Gly Asp Gin 



Ser Lys Glu Lys 

120 

Gin Met He Tyr 
135 

Lys Asp Glu Val 
150 

Gly Ser Gly Thr 
165 

Glu Val Ala Thr 



Tyr Thr Phe Glu 

200 

Gly Pro Pro Gly 
215 

Pro Ser Arg Met 
230 



Val Glu Asp Tyr 

245 

Pro Tyr Val Gly 



He Ser Asn Gin 

280 

Ala Gly Thr Lys 



34 

25 

Glu Asp Pro Leu 



Asn Gly Tyr Lys 

60 

Tyr Glu Thr Asp 

75 

Glu Gly Ala Val 
90 

Glu Tyr Phe Pro 
105 

Pro Asn Ala Tyr 



Leu Phe Leu Gin 

140 

Thr Leu Leu Thr 
155 

Tyr Met Gly Gin 
170 

Gly Arg Asn Pro 
185 

Ser He Ala Gin 



Glu Asp Asp Lys 

220 

Leu Val Leu Thr 
235 



Leu Pro Lys He 

250 

Arg Thr Lys Gly 
265 

Phe Leu Arg Glu 



Gly Ser Asn Lys 



30 

Ala Ser Pro Ser 
45 

Val Leu Gin Pro 



Gin He Leu Pro 

80 

Leu Lys Pro Thr 

95 

Lys Tyr Tyr Pro 
110 

Pro Pro Asp He 
125 

Val Pro Glu Ala 



Gin Asn He Arg 

160 

Ala Asn Arg Leu 
175 

Ash Lys Asp Pro 
190 

Leu Leu Asp He 
205 

Pro Trp Val Pro 



Gly Asp Val Asp 

240 



Asn Leu Lys Ser 
255 

Glu Thr He Gly 
270 

Leu Ser Thr Leu 
285 

Lys Lys Leu Leu 
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290 295 300 

Ser Met Leu Ser Asp Tyr Trp Tyr Leu Ser Cys Gly Leu Leu Phe Pro 

305 310 315 320 

Lys Ala Glu Arg Tyr Asp Lys Ser Thr Trp Leu Thr Lys Thr Arg Asn 

325 330 335 

lie Trp Ser Ala Pro Ser Pro Thr His Leu Met lie Ser Met lie Thr 

340 345 350 

Trp Pro Val Met Ser Asn Ser Pro Asn Asn Val Leu Asn lie Glu Gly 
355 360 



Cys Pro Ser Leu Tyr Lys Phe Asn Pro Phe Arg Gly Gly Leu Asn Arg 
370 375 380 

He Val Glu Trp He Leu Ala Pro Glu Glu Pro Lys Ala Leu Val Tyr 
385 390 395 400 

Ala Asp Asn He Tyr He Val His Ser Asn Thr Trp Tyr Ser He Asp 

405 410 415 

Leu Glu Lys Gly Glu Ala Asn Cys Thr Arg Gin His Met Gin Ala Ala 

420 425 430 

Met Tyr Tyr He Leu Thr Arg Gly Trp Ser Asp Asn Gly Asp Pro Met 
435 440 445 

Phe Asn Gin Thr Trp Ala Thr Phe. Ala Met Asn He Ala Pro Ala Leu 
450 455 460 

Val Val Asp Ser Ser Cys Leu lie Met Asn Leu Gin He Lys Thr Tyr 
465 470 475 480 

Gly Gin Gly Ser Gly Asn Ala Ala Thr Phe He Asn Asn His Leu Leu 

485 490 495 

Ser Thr Leu Val Leu Asp Gin Trp Asn Leu Met Arg Gin Pro Arg Pro 

500 505 510 

Asp Ser Glu Glu Phe Lys Ser He Glu Asp Lys Leu Gly He Asn Phe 
515 520 



■ • 



Lys lie Glu Arg 
530 

Val Leu Leu Ala 

545 

Gin Ser Ser Pro 



Ser He Asp Asp 
535 

Gin Pro Gly Tyr 
550 

Thr Val Glu Leu 



He Arg Gly Lys 

540 

Leu Ser Gly Gly 
555 

Asp Leu Leu Gly 



Leu Arg Gin Leu 

Val Glu Pro Glu 

560 

Trp Ser Ala Thr 
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565 570 575 

Tyr Ser Lys Asp Leu Gly lie Tyr Val Pro Val Leu Asp Lys Glu Arg 

580 585 590 

Leu Phe Cys Ser Ala Ala Tyr Pro Lys Gly Val Glu Asn Lys Ser Leu 

595 600 605 

Lys Ser Lys Val Gly He Glu Gin Ala Tyr Lys Val Val Arg Tyr Glu 
610 615 620 

Ala Leu Arg Leu Val Gly Gly Trp Asn Tyr Pro Leu Leu Asn Lys Ala 
625 630 635 640 

Cys Lys Asn Asn Ala Gly Ala Ala Arg Arg His Leu Glu Ala Lys iGly 

645 650 655 

Phe Pro Leu Asp Glu Phe Leu Ala Glu Trp Ser Glu Leu Ser Glu Phe 

660 665 670 

Gly Glu Ala Phe Glu Gly Phe Asn He Lys Leu Thr Val Thr Ser Glu 

675 680 685 

Ser Leu Ala Glu Leu Asn Lys Pro Val Pro Pro Lys Pro Pro Asn Val 
690 695 700 

Asn Arg Pro Val Asn Thr Gly Gly Leu Lys Ala Val Ser Asn Ala Leu 
705 710 715 720 

Lys Thr Gly Arg Tyr Arg Asn Glu Ala Gly Leu Ser Gly Leu Val Leu 

725 730 735 



Leu Ala Thr Ala Arg Ser Arg Leu 

740 

Glu Ala Glu Lys Leu His Lys Ser 

755 760 



Gin Asp Ala Val Lys Ala Lys Ala 
745 750 

Lys Pro Asp Asp Pro Asp Ala Asp 

765 



Trp Phe Glu Arg Ser Glu Thr Leu Ser Asp Leu Leu Glu Lys Ala Asp 
770 775 780 

He Ala Ser Lys Val Ala His Ser Ala Leu Val Glu Thr Ser Asp Ala 
785 790 795 800 

Leu Glu Ala Val Gin Ser Thr Ser Val Tyr Thr Pro Lys Tyr Pro Glu 

805 810 815 



Val Lys Asn Pro Gin Thr Ala Ser Asn Pro Val Val Gly Leu His Leu 

820 825 630 

Pro Ala Lys Arg Ala Thr Gly Val Gin Ala Ala Leu Leu Gly Ala Gly 
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835 840 845 

Thr Ser Arg Pro Met Gly Met Glu Ala Pro Thr Arg Ser Lys Asn Ala 
850 855 860 

Val Lys Met Ala Lys Arg Arg Gin Arg Gin Lys Glu Ser Arg 
865 870 875 

(2) INFORMATION FOR SEQ ID NO: 27: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 97.. 531 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



GGATACGATC GGTCTGACCC CGGGGGAGTC ACCCGGGGAC AGGCCGTCAA GGCCTTGTTC 

CAGGATGGGA CTCCTCCTTC TACAACGCTA TCATTG ATG GTT AGT AGA GAT CAG 

Met Val Ser Arg Asp Gin 
880 

ACA AAC GAT CGC AGC GAT GAC AAA CCT GCA AGA TCA AAC CCA ACA GAT 

Thr Asn Asp Arg Ser Asp Asp Lys Pro Ala Arg Ser Asn Pro Thr Asp 

885 890 895 900 

TGT TCC GTT CAT ACG GAG CCT TCT GAT GCC AAC AAC CGG ACC GGC GTC 

Cys Ser Val His Thr Glu Pro Ser Asp Ala Asn Asn Arg Thr Gly Val 

905 910 915. 

CAT TCC GGA CGA CAC CCT GGA GAA GCA CAC TCT CAG GTC AGA GAC CTC 

His Ser Gly Arg His Pro Gly Glu Ala His Ser Gin Val Arg Asp Leu 

920 925 930 

GAC CTA CAA TTT GAC TGT GGG GGA CAC AGG GTC AGG GCT AAT TGT CTT 
Asp Leu Gin Phe Asp Cys Gly Gly His Arg Val Arg Ala Asn Cys Leu 

935 940 945 

TTT CCC TGG ATT CCC TGG CTC AAT TGT GGG TGC TCA CTA CAC ACT GCA 
Phe Pro Trp He Pro Trp Leu Asn Cys Gly Cys Ser Leu His Thr Ala 
950 955 960 
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GGG CAA TGG GAA CTA CAA GTT CGA TCA GAT GCT CCT GAC TGC CCA GAA 402 
Gly Gin Tip Glu Leu Gin Val Arg Ser Asp Ala Pro Asp Cys Pro Glu 
965 970 975 980 

CCT ACC GGC CAG TTA CAA CTA CTG CAG GCT AGT GAG TCG GAG TCT CAC 450 
Pro Thr Gly Gin Leu Gin Leu Leu Gin Ala Ser Glu Ser Glu Ser His 

985 990 995 

r 

AGT GAG GTC AAG CAC ACT TCC TGG TGG CGT TTA TGC ACT AAA CGG CAC 498 
Ser Glu Val Lys His Thr Ser Trp Trp Arg Leu Cys Thr Lys Arg His 

1000 1005 1010 

CAT AAA CGC CGT GAC CTT CCA AGG AAG CCT GAG TGAACTGACA GATGTTAGCT 551 
His Lys Arg Arg Asp Leu Pro Arg Lys Pro Glu 

1015 1020 

ACAATGGGTT GATGTCTGCA ACAGCCAACA TCAACGACAA AATTGGGAAC GTCCTAGTAG 611 
GGGAAGGGGT CACCGTCCTC AGCTTACCCA CATCATATGA TCTTGGGTAT GTGAGGCTTG 671 

0 

GTGACCCCAT TCCCGCAATA GGGCTTGACC CAAAAATGGT AGCCACATGT GACAGCAGTG 731 

ACAGGCCCAG AGTCTACACC ATAACTGCAG CCGATGATTA CCAATTCTCA TCACAGTACC 791 

AACCAGGTGG GGTAACAATC ACACTGTTCT CAGCCAACAT TGATGCCATC ACAAGCCTCA 851 

GCGTTGGGGG AGAGCTCGTG TTTCAAACAA GCGTCCACGG CCTTGTACTG GGCGCCACCA 911 

TCTACCTCAT AGGCTTTGAT GGGACAACGG TAATCACCAG GGCTGTGGCC GCAAACAATG 971 

GGCTGACGAC CGGCACCGAC AACCTTATGC CATTCAATCT TGTGATTCCA ACAAACGAGA 1031 

TAACCCAGCC AATCACATCC ATCAAACTGG AGATAGTGAC CTCCAAAAGT GGTGGTCAGG 1091 



* - • 



CAGGGGATCA GATGTCATGG TCGGCAAGAG GGAGCCTAGC AGTGACGATC CATGGTGGCA 1151 

ACTATCCAGG GGCCCTCCGT CCCGTCACGC TAGTGGCCTA CGAAAGAGTG GCAACAGGAT 1211 

CCGTCGTTAC GGTCGCTGGG GTGAGCAACT TCGAGCTGAT CCCAAATCCT GAACTAGCAA 1271 

AGAACCTGGT TACAGAATAC GGCCGATTTG ACCCAGGAGC CATGAACTAC ACAAAATTGA 1331 

TACTGAGTGA GAGGGACCGT CTTGGCATCA AGACCGTCTG GCCAACAAGG GAGTACACTG 1391 

ACTTTCGTGA ATACTTCATG GAGGTGGCCG ACCTCAACTC TCCCCTGAAG ATTGCAGGAG 1451 

CATTCGGCTT CAAAGACATA ATCCGGGCCA TAAGGAGGAT AGCTGTGCCG GTGGTCTCCA 1511 

CATTGTTCCC ACCTGCCGCT CCCCTAGCCC ATGCAATTGG GGAAGGTGTA GACTACCTGC 1571 

TGGGCGATGA GGCACAGGCT GCTTCAGGAA CTGCTCGAGC CGCGTCAGGA AAAGCAAGAG 1631 
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CTGCCTCAGG CCGCATAAGG CAGCTGACTC TCGCCGCCGA CAAGGGGTAC GAGGTAGTCG 1691 

CGAATCTATT CCAGGTGCCC CAGAATCCCG TAGTCGACGG GATTCTTGCT TCACCTGGGG 1751 

TACTCCGCGG TGCACACAAC CTCGACTGCG TGTTAAGAGA GGGTGCCACG CTATTCCCTG 1811 

TGGTTATTAC GACAGTGGAA GACGCCATGA CACCCAAAGC ATTGAACAGC AAAATGTTTG 1871 

* 

■ 

CTGTCATTGA AGGCGTGCGA GAAGACCTCC AACCTCCATC TCAAAGAGGA TCCTTCATAC 1931 

GAACTCTCTC TGGACACAGA GTCTATGGAT ATGCTCCAGA TGGGGTACTT CCACTGGAGA 1991 

CTGGGAGAGA CTACACCGTT GTCCCAATAG ATGATGTCTG GGACGACAGC ATTATGCTGT 2051 

CCAAAGATCC CATACCTCCT ATTGTGGGAA ACAGTGGAAA TCTAGCCATA GCTTACATGG 2111 

ATGTGTTTCG ACCCAAAGTC CCAATCCATG TGGGTATGAC GGGAGCCCTC AATGCTTGTG 2171 

GCGAGATTGA GAAAGTAAGC TTTAGAAGCA CCAAGCTCGC CACTGCACAC CGACTTGGCC 2231 

TTAGGTTGGC TGGTCCCGGA GCATTCGATG TAAACACCGG GCCCAACTGG GCAACGTTCA 2291 

TCAAACGTTT CCCTCACAAT CCACGCGACT GGGACAGGCT CCCCTACCTC AACCTACCAT 2351 

ACCTTCCACC CAATGCAGGA CGCCAGTACC ACCTTGCCAT GGCTGCATCA GAGTTCAAAG 2411 

AGACCCCCGA ACTCGAGAGT GCCGTCAGAG CAATGGAAGC AGCAGCCAAC GTGGACCCAC 2471 

TATTCCAATC TGCACTCAGT GTGTTCATGT GGCTGGAAGA GAATGGGATT GTGACTGACA 2531 

TGGCCAACTT CGCACTCAGC GACCCGAACG CCCATCGGAT GCGAAATTTT CTTGCAAACG 2591 

CACCACAAGC AGGCAGCAAG TCGCAAAGGG CCAAGTACGG GACAGCAGGC TACGGAGTGG 2651 

i 

AGGCTCGGGG CCCCACACCA GAGGAAGCAC AGAGGGAAAA AGACACACGG ATCTCAAAGA 2711 

AGATGGAGAC CATGGGCATC TACTTTGCAA CACCAGAATG GGTAGCACTC AATGGGCACC 2771 

GAGGGCCAAG CCCCGGCCAG CTAAAGTACT GGCAGAACAC ACGAGAAATA CCGGACCCAA 2831 

ACGAGGACTA TCTAGACTAC GTGCATGCAG AGAAGAGCCG GTTGGCATCA GAAGAACAAA 2891 

TCCTAAGGGC AGCTACGTCG ATCTACGGGG CTCCAGGACA GGCAGAGCCA CCCCAAGCTT 2951 

TCATAGACGA AGTTGCCAAA GTCTATGAAA TCAACCATGG ACGTGGCCCA AACCAAGAAC 3011 

AGATGAAAGA TCTGCTCTTG ACTGCGATGG AGATGAAGCA TCGCAATCCC AGGCGGGCTC 3071 

TACCAAAGCC CAAGCCAAAA CCCAATGCTC CAACACAGAG ACCCCCTGGT CGGCTGGGCC 3131 

GCTGGATCAG GACCGTCTCT GATGAGGACC TTGAGTGAGG CTCCTGGGAG TCTCCCGACA 3191 
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CCACCCGCGC AGGTGTGGAC ACCAATTCGG CCTTACAACA TCCCAAATTG GATCCGTTCG 3251 
CGGGTCCCCT 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Val Ser Arg Asp Gin Thr Asn Asp Arg Ser Asp Asp Lys Pro Ala 
15 10 15 

Arg Ser Asn Pro Thr Asp Cys Ser Val His Thr Glu Pro Ser Asp Ala 

20 25 30 

Asn Asn Arg Thr Gly Val His Ser Gly Arg His Pro Gly Glu Ala His 

35 40 45 

Ser Gin Val Arg Asp Leu Asp Leu Gin Phe Asp Cys Gly Gly His Arg 
50 55 60 

Val Arg Ala Asn Cys Leu Phe Pro Trp lie Pro Trp Leu Asn Cys Gly 
65 70 75 80 

Cys Ser Leu His Thr Ala Gly Gin Trp Glu Leu Gin Val Arg Ser Asp 

85 90 95 

Ala Pro Asp Cys Pro Glu Pro Thr Gly Gin Leu Gin Leu Leu Gin Ala 

100 105 110 

Ser Glu Ser Glu Ser His Ser Glu Val Lys His Thr Ser Trp Trp Arg 
115 120 125 

Leu Cys Thr Lys Arg His His Lys Arg Arg Asp Leu Pro Arg Lys Pro 
130 135 140 

Glu 
145 



3261 



* * 



(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 3261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 131. • 3166 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGATACGATC GGTCTGACCC CGGGGGAGTC ACCCGGGGAC AGGCCGTCAA GGCCTTGTTC 60 

CAGGATGGGA CTCCTCCTTC TACAACGCTA TCATTGATGG TTAGTAGAGA TCAGACAAAC 120 

GATCGCAGCG ATG ACA AAC CTG CAA GAT CAA ACC CAA CAG ATT GTT CCG 169 

Met Thr Asn Leu Gin Asp Gin Thr Gin Gin lie Val Pro 

150 155 



TTC ATA CGG AGC CTT CTG ATG CCA ACA ACC GGA CCG GCG TCC ATT CCG 217 
Phe He Arg Ser Leu Leu Met Pro Thr Thr Gly Pro Ala Ser He Pro 
160 165 170 

GAC GAC ACC CTG GAG AAG CAC ACT CTC AGG TCA GAG ACC TCG ACC TAC 265 

Asp Asp Thr Leu Glu Lys His Thr Leu Arg Ser Glu Thr Ser Thr Tyr 

175 180 185 190 

AAT TTG ACT GTG GGG GAC ACA GGG TCA GGG CTA ATT GTC TTT TTC CCT 313 
Asn Leu Thr Val Gly Asp Thr Gly Ser Gly Leu He Val Phe Phe Pro 

195 200 205 

GGA TTC CCT GGC TCA ATT GTG GGT GCT CAC TAC ACA CTG CAG GGC AAT 361 
Gly Phe Pro Gly Ser He Val Gly Ala His Tyr Thr Leu Gin Gly Asn 

210 215 220 

GGG AAC TAC AAG TTC GAT CAG ATG CTC CTG ACT GCC CAG AAC CTA CCG 409 

Gly Asn Tyr Lys Phe Asp Gin Met Leu Leu Thr Ala Gin Asn Leu Pro 
225 230 235 



GCC AGT TAC AAC TAC TGC AGG CTA GTG AGT CGG AGT CTC ACA GTG AGG 
Ala Ser Tyr Asn Tyr Cys Arg Leu Val Ser Arg Ser Leu Thr Val Arg 
240 245 250 



457 



TCA AGC ACA CTT CCT GGT GGC GTT TAT GCA CTA AAC GGC ACC ATA AAC 

Ser Ser Thr Leu Pro Gly Gly Val Tyr Ala Leu Asn Gly Thr He Asn 

260 265 270 



505 



GCC GTG ACC TTC CAA GGA AGC CTG AGT GAA CTG ACA GAT GTT AGC TAC 



553 
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Ala Val Thr Phe Gin Gly Ser Leu Ser Glu Leu Thr Asp Val Ser Tyr 

275 280 285 

AAT GGG TTG ATG TCT GCA ACA GCC AAC ATC AAC GAC AAA ATT GGG AAC 601 
Asn Gly Leu Met Ser Ala Thr Ala Asn lie Asn Asp Lys lie Gly Asn 

290 295 300 

GTC CTA GTA GGG GAA GGG GTC ACC GTC CTC AGC TTA CCC ACA TCA TAT 649 
Val Leu Val Gly Glu Gly Val Thr Val Leu Ser Leu Pro Thr Ser Tyr 
305 310 315 



GAT CTT GGG TAT GTG AGG CTT GGT GAC CCC ATT CCC GCA ATA GGG CTT 697 
Asp Leu Gly Tyr Val Arg Leu Gly Asp Pro lie Pro Ala lie Gly Leu 
320 325 330 

GAC CCA AAA ATG GTA GCC ACA TGT GAC AGC AGT GAC AGG CCC AGA GTC 745 
Asp Pro Lys Met Val Ala Thr Cys Asp Ser Ser Asp Arg Pro Arg Val 

340 345 350 



TAC ACC ATA ACT GCA GCC GAT GAT TAC CAA TTC TCA TCA CAG TAC CAA 793 
Tyr Thr lie Thr Ala Ala Asp Asp Tyr Gin Phe Ser Ser Gin Tyr Gin 

360 365 



CCA GGT GGG GTA ACA ATC ACA CTG TTC TCA GCC AAC ATT GAT GCC ATC 841 
Pro Gly Gly Val Thr lie Thr Leu Phe Ser Ala Asn lie Asp Ala lie 

370 375 380 

ACA AGC CTC AGC GTT GGG GGA GAG CTC GTG TTT CAA ACA AGC GTC CAC 889 
Thr Ser Leu Ser Val Gly Gly Glu Leu Val Phe Gin Thr Ser Val His 
385 390 395 

GGC CTT GTA CTG GGC GCC ACC ATC TAC CTC ATA GGC TTT GAT GGG ACA 937 
Gly Leu Val Leu Gly Ala Thr lie Tyr Leu lie Gly Phe Asp Gly Thr 
400 405 410 

ACG GTA ATC ACC AGG GCT GTG GCC GCA AAC AAT GGG CTG ACG ACC GGC 985 
Thr Val lie Thr Arg Ala Val Ala Ala Asn Asn Gly Leu Thr Thr Gly 
415 420 425 430 

ACC GAC AAC CTT ATG CCA TTC AAT CTT GTG ATT CCA ACA AAC GAG ATA 1033 
Thr Asp Asn Leu Met Pro Phe Asn Leu Val lie Pro Thr Asn Glu lie 

435 440 445 

ACC CAG CCA ATC ACA TCC ATC AAA CTG GAG ATA GTG ACC TCC AAA AGT 1081 
Thr Gin Pro lie Thr Ser lie Lys Leu Glu lie Val Thr Ser Lys Ser 

450 455 460 



GGT GGT CAG GCA GGG GAT CAG ATG TCA TGG TCG GCA AGA GGG AGC CTA 
Gly Gly Gin Ala Gly Asp Gin Met Ser Trp Ser Ala Arg Gly Ser Leu 
465 470 475 



1129 
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GCA GTG ACG ATC CAT GGT GGC AAC TAT CCA GGG GCC CTC CGT CCC GTC 1177 
Ala Val Thr He His Gly Gly Asn Tyr Pro Gly Ala Leu Arg Pro Val 
480 485 490 

ACG CTA GTG GCC TAC GAA AGA GTG GCA ACA GGA TCC GTC GTT ACG GTC 1225 
Thr Leu Val Ala Tyr Glu Arg Val Ala Thr Gly Ser Val Val Thr Val 
495 500 505 510 

GCT GGG GTG AGC AAC TTC GAG CTG ATC CCA AAT CCT GAA CTA GCA AAG 1273 
Ala Gly Val Ser Asn Phe Glu Leu He Pro Asn Pro Glu Leu Ala Lys 

515 520 



AAC CTG GTT ACA GAA TAC GGC CGA TTT GAC CCA GGA GCC ATG AAC TAC 1321 
Asn Leu Val Thr Glu Tyr Gly Arg Phe Asp Pro Gly Ala Met Asn Tyr 

530 535 540 

ACA AAA TTG ATA CTG AGT GAG AGG GAC CGT CTT GGC ATC AAG ACC GTC 1369 
Thr Lys Leu He Leu Ser Glu Arg Asp Arg Leu Gly He Lys Thr Val 
545 550 



TGG CCA ACA AGG GAG TAC ACT GAC TTT CGT GAA TAC TTC ATG GAG GTG 1417 
Trp Pro Thr Arg Glu Tyr Thr Asp Phe Arg Glu Tyr Phe Met Glu Val 
560 565 570 

GCC GAC CTC AAC TCT CCC CTG AAG ATT GCA GGA GCA TTC GGC TTC AAA 1465 
Ala Asp Leu Asn Ser Pro Leu Lys He Ala Gly Ala Phe Gly Phe Lys 
575 580 585 590 

GAC ATA ATC CGG GCC ATA AGG AGG ATA GCT GTG CCG GTG GTC TCC ACA 1513 
Asp He He Arg Ala He Arg Arg He Ala Val Pro Val Val Ser Thr 

595 600 60S 

TTG TTC CCA CCT GCC GCT CCC CTA GCC CAT GCA ATT GGG GAA GGT GTA 1561 
Leu Phe Pro Pro Ala Ala Pro Leu Ala His Ala He Gly Glu Gly Val 

610 615 620 

GAC TAC CTG CTG GGC GAT GAG GCA CAG GCT GCT TCA GGA ACT GCT CGA 1609 
Asp Tyr Leu Leu Gly Asp Glu Ala Gin Ala Ala Ser Gly Thr Ala Arg 

625 630 635 

GCC GCG TCA GGA AAA GCA AGA GCT GCC TCA GGC CGC ATA AGG CAG CTG 1657 

Ala Ala Ser Gly Lys Ala Arg Ala Ala Ser Gly Arg He Arg Gin Leu 

640 645 650 

ACT CTC GCC GCC GAC AAG GGG TAC GAG GTA GTC GCG AAT CTA TTC CAG 1705 
Thr Leu Ala Ala Asp Lys Gly Tyr Glu Val Val Ala Asn Leu Phe Gin 

660 665 670 



GTG CCC CAG AAT CCC GTA GTC GAC GGG ATT CTT GCT TCA CCT GGG GTA 1753 
Val Pro Gin Asn Pro Val Val Asp Gly He Leu Ala Ser. Pro Gly Val 

675 680 685 
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CTC CGC GGT GCA CAC AAC CTC GAC TGC GTG TTA AGA GAG GGT GCC ACG 1801 

Leu Arg Gly Ala His Asn Leu Asp Cys Val Leu Arg Glu Gly Ala Thr 

690 695 700 

CTA TTC CCT GTG GTT ATT ACG ACA GTG GAA GAC GCC ATG ACA CCC AAA 184 9 
Leu Phe Pro Val Val He Thr Thr Val Glu Asp Ala Met Thr Pro Lys 
705 710 715 

GCA TTG AAC AGC AAA ATG TTT GCT GTC ATT GAA GGC GTG CGA GAA GAC 1897 
Ala Leu Asn Ser Lys Met Phe Ala Val He Glu Gly Val Arg Glu Asp 
720 725 730 

CTC CAA CCT CCA TCT CAA AGA GGA TCC TTC ATA CGA ACT CTC TCT GGA 1945 
Leu Gin Pro Pro Ser Gin Arg Gly Ser Phe He Arg Thr Leu Ser Gly 
735 740 745 750 

CAC AGA GTC TAT GGA TAT GCT CCA GAT GGG GTA CTT CCA CTG GAG ACT 1993 
His Arg Val Tyr Gly Tyr Ala Pro Asp Gly Val Leu Pro Leu Glu Thr 

755 760 765 

GGG AGA GAC TAC ACC GTT GTC CCA ATA GAT GAT GTC TGG GAC GAC AGC 2041 
Gly Arg Asp Tyr Thr Val Val Pro He Asp Asp Val Trp Asp Asp Ser 

770 775 780 

ATT ATG CTG TCC AAA GAT CCC ATA CCT CCT ATT GTG GGA AAC AGT GGA 2089 
He Met Leu Ser Lys Asp Pro He Pro Pro lie Val Gly Ash Ser Gly 
785 790 795 

AAT CTA GCC ATA GCT TAC ATG GAT GTG TTT CGA CCC AAA GTC CCA ATC 2137 
Asn Leu Ala He Ala Tyr Met Asp Val Phe Arg Pro Lys Val Pro He 
800 805 810 * 

CAT GTG GCT ATG ACG GGA GCC CTC AAT GCT TGT GGC GAG ATT GAG AAA 2185 
His Val Ala Met Thr Gly Ala Leu Asn Ala Cys Gly Glu lie Glu Lys 
815 820 825 830 

GTA AGC TTT AGA AGC ACC AAG CTC GCC ACT GCA CAC CGA CTT GGC CTT 2233 

Val Ser Phe Arg Ser Thr Lys Leu Ala Thr Ala His Arg Leu Gly Leu 

835 840 845 

AGG TTG GCT GGT CCC GGA GCA TTC GAT GTA AAC ACC GGG CCC AAC TGG 2281 
Arg Leu Ala Gly Pro Gly Ala Phe Asp Val Asn Thr Gly Pro Asn Trp 

850 855 860 

GCA ACG TTC ATC AAA CGT TTC CCT CAC AAT CCA CGC GAC TGG GAC AGG 2329 
Ala Thr Phe He Lys Arg Phe Pro His Asn Pro Arg Asp Trp Asp Arg 

865 870 875 



CTC CCC TAC CTC AAC CTA CCA TAC CTT CCA CCC AAT GCA GGA CGC CAG 

Leu Pro Tyr Leu Asn Leu Pro Tyr Leu Pro Pro Asn Ala Gly Arg Gin 
880 885 890 



2377 
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TAC CAC CTT GCC ATG GCT GCA TCA GAG TTC AAA GAG ACC CCC GAA CTC 2425 

Tyr His Leu Ala Met Ala Ala Ser Glu Phe Lys Glu Thr Pro Glu Leu 

895 900 905 910 

GAG AGT GCC GTC AGA GCA ATG GAA GCA GCA GCC AAC GTG GAC CCA CTA 2473 
Glu Ser Ala Val Arg Ala Met Glu Ala Ala Ala Asn Val Asp Pro Leu 

915 920 



TTC CAA TCT GCA CTC AGT GTG TTC ATG TGG CTG GAA GAG AAT GGG ATT 2521 
Phe Gin Ser Ala Leu Ser Val Phe Met Tip Leu Glu Glu Asn Gly lie 

930 935 940 

GTG ACT GAC ATG GCC AAC TTC GCA CTC AGC GAC CCG AAC GCC CAT CGG 2569 
Val Thr Asp Met Ala Asn Phe Ala Leu Ser Asp Pro Asn Ala His Arg 
945 950 955 

ATG CGA AAT TTT CTT GCA AAC GCA CCA CAA GCA GGC AGC AAG TCG CAA 2617 
Met Arg Asn Phe Leu Ala Asn Ala Pro Gin Ala Gly Ser Lys Ser Gin 
960 965 970 

AGG GCC AAG TAC GGG ACA GCA GGC TAC GGA GTG GAG GCT CGG GGC CCC 2665 
Arg Ala Lys Tyr Gly Thr Ala Gly Tyr Gly Val Glu Ala Arg Gly Pro 
975 980 985 990 

_ 

ACA CCA GAG GAA GCA CAG AGG GAA AAA GAC ACA CGG ATC TCA AAG AAG 2713 

Thr Pro Glu Glu Ala Gin Arg Glu Lys Asp Thr Arg He Ser Lys Lys 

1000 1005 



ATG GAG ACC ATG GGC ATC TAC TTT GCA ACA CCA GAA TGG GTA GCA CTC 2761 
Met Glu Thr Met Gly He Tyr Phe Ala Thr Pro Glu Trp Val Ala Leu 

1010 1015 1020 

AAT GGG CAC CGA GGG CCA AGC CCC GGC CAG CTA AAG TAC TGG CAG AAC 2809 

Asn Gly His Arg Gly Pro Ser Pro Gly Gin Leu Lys Tyr Trp Gin Asn 
1025 1030 1035 

ACA CGA GAA ATA CCG GAC CCA AAC GAG GAC TAT CTA GAC TAC GTG CAT 2857 
Thr Arg Glu He Pro Asp Pro Asn Glu Asp Tyr Leu Asp Tyr Val His 
1040 1045 1050 

GCA GAG AAG AGC CGG TTG GCA TCA GAA GAA CAA ATC CTA AGG GCA GCT 2905 
Ala Glu Lys Ser Arg Leu Ala Ser Glu Glu Gin He Leu Arg Ala Ala 
1055 1060 1065 1070 

ACG TCG ATC TAC GGG GCT CCA GGA CAG GCA GAG CCA CCC CAA GCT TTC 2953 
Thr Ser He Tyr Gly Ala Pro Gly Gin Ala Glu Pro Pro Gin Ala Phe 

1075 1080 1085 

ATA GAC GAA GTT GCC AAA GTC TAT GAA ATC AAC CAT GGA CGT GGC CCA 3001 
He Asp Glu Val Ala Lys Val Tyr Glu He Asn His Gly Arg Gly Pro 

1090 1095 1100 
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AAC CAA GAA CAG ATG AAA GAT CTG CTC TTG ACT GCG ATG GAG ATG AAG 3049 
Asn Gin Glu Gin Met Lys Asp Leu Leu Leu Thr Ala Met Glu Met Lys 
1105 1110 1115 

CAT CGC AAT CCC AGG CGG GCT CTA CCA AAG CCC AAG CCA AAA CCC AAT 3097 
His Arg Asn Pro Arg Arg Ala Leu Pro Lys Pro Lys Pro Lys Pro Asn 
1120 1125 1130 

GCT CCA ACA CAG AGA CCC CCT GGT CGG CTG GGC CGC TGG ATC AGG ACC 3145 
Ala Pro Thr Gin Arg Pro Pro Gly Arg Leu Gly Arg Trp lie Arg Thr 

1135 1140 H45 ii5 0 

GTC TCT GAT GAG GAC CTT GAG TGAGGCTCCT GGGAGTCTCC CGACACCACC 3196 
Val Ser Asp Glu Asp Leu Glu 

1155 

CGCGCAGGTG TGGACACCAA TTCGGCCTTA CAACATCCCA AATTGGATCC GTTCGCGGGT 3256 

m • ** * • » ■ ft 

CCCCT 



(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Met Thr Asn Leu Gin Asp Gin Thr Gin Gin He Val Pro Phe He Arg 
1 5 10 15 

Ser Leu Leu Met Pro Thr Thr Gly Pro Ala Ser He Pro Asp Asp Thr 

20 25 30 

Leu Glu Lys His Thr Leu Arg Ser Glu Thr Ser Thr Tyr Asn Leu Thr 

35 40 45 

Val Gly Asp Thr Gly Ser Gly Leu He Val Phe Phe Pro Gly Phe Pro 
50 55 60 

Gly Ser He Val Gly Ala His Tyr Thr Leu Gin Gly Asn Gly Asn Tyr 
65 70 75 80 

Lys Phe Asp Gin Met Leu Leu Thr Ala Gin Asn Leu Pro Ala Ser Tyr 

85 90 95 

Asn Tyr Cys Arg Leu Val Ser Arg Ser Leu Thr Val Arg Ser Ser Thr 

100 105 110 
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Leu Pro Gly Gly Val Tyr Ala Leu Asn Gly Thr He Asn Ala Val Thr 
115 120 125 



Phe Gin Gly 
130 



Leu Ser Glu Leu Thr Asp Val Ser Tyr Asn Gly Leu 
135 140 



Met 
145 



Ala Thr Ala Asn He Asn Asp Lys He Gly Asn Val Leu Val 

150 155 160 



Gly Glu Gly Val Thr Val Leu Ser Leu Pro Thr Ser Tyr Asp Leu Gly 

165 170 175 

Tyr Val Arg Leu Gly Asp Pro lie Pro Ala He Gly Leu Asp Pro Lys 

180 185 190 

Met Val Ala Thr Cys Asp Ser Ser Asp Arg Pro Arg Val Tyr Thr He 
195 200 205 

Thr Ala Ala Asp Asp Tyr Gin Phe Ser Ser Gin Tyr Gin Pro Gly Gly 
210 215 220 



Val Thr lie Thr Leu Phe 
225 230 



Ala Asn He Asp Ala He Thr 

235 



Leu 
240 



Ser Val Gly Gly Glu Leu Val Phe Gin Thr Ser Val His Gly Leu Val 

245 250 



Leu Gly Ala Thr He Tyr Leu He Gly Phe Asp Gly Thr Thr Val He 

260 265 270 

Thr Arg Ala Val Ala Ala Asn Asn Gly Leu Thr Thr Gly Thr Asp Asn 
275 280 285 

Leu Met Pro Phe Asn Leu Val He Pro Thr Asn Glu He Thr Gin Pro 
290 295 300 

He Thr Ser He Lys Leu Glu He Val Thr Ser Lys Ser Gly Gly Gin 
305 310 315 320 

■ 

Ala Gly Asp Gin Met Ser Trp Ser Ala Arg Gly Ser Leu Ala Val Thr 

325 330 335 

He His Gly Gly Asn Tyr Pro Gly Ala Leu Arg Pro Val Thr Leu Val 

340 345 350 

Ala Tyr Glu Arg Val Ala Thr Gly Ser Val Val Thr Val Ala Gly Val 
355 360 365 

Ser Asn Phe Glu Leu He Pro Asn Pro Glu Leu Ala Lys Asn Leu Val 
370 375 380 

Thr Glu Tyr Gly Arg Phe Asp Pro Gly Ala Met Asn Tyr Thr Lys Leu 
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385 



390 



3 95 



400 



He Leu Ser Glu Arg Asp Arg Leu Gly He Lys Thr Val Trp Pro Thr 

405 410 415 

Arg Glu Tyr Thr Asp Phe Arg Glu Tyr Phe Met Glu Val Ala Asp Leu 

420 425 430 

Asn Ser Pro Leu Lys He Ala Gly Ala Phe Gly Phe Lys Asp He He 
435 440 445 



Arg Ala He Arg Arg He Ala Val Pro Val Val Ser Thr Leu Phe Pro 
450 455 460 



Pro Ala Ala Pro Leu Ala His Ala 
465 470 



Gly Glu Gly Val Asp Tyr Leu 

475 480 



Leu Gly Asp Glu Ala Gin Ala Ala Ser Gly Thr Ala Arg Ala Ala Ser 

485 490 495 

Gly Lys Ala Arg Ala Ala Ser Gly Arg He Arg Gin Leu Thr Leu Ala 

500 505 510 



Ala Asp Lys Gly Tyr Glu Val Val Ala Asn Leu Phe Gin Val Pro Gin 

520 525 



Asn Pro Val Val Asp Gly He Leu Ala Ser Pro Gly Val Leu Arg Gly 
530 535 540 



Ala His Asn Leu Asp Cys Val Leu Arg Glu Gly Ala Thr Leu Phe Pro 

550 555 560 



Val Val He Thr Thr Val Glu Asp Ala Met Thr Pro Lys Ala Leu Asn 

565 570 575 



Ser Lys Met Phe Ala Val He Glu Gly Val Arg Glu Asp Leu Gin Pro 

580 585 590 



Pro Ser Gin Arg Gly Ser Phe He Arg Thr Leu Ser Gly His Arg Val 
595 600 605 



Tyr Gly Tyr Ala Pro 
610 



Gly Val Leu Pro Leu Glu Thr Gly Arg Asp 
615 620 



Tyr Thr Val Val Pro He Asp Asp Val Trp Asp Asp Ser He Met Leu 

630 635 640 



Ser Lys Asp Pro He Pro Pro He Val Gly Asn Ser Gly Asn Leu Ala 

645 650 655 



He Ala Tyr Met Asp Val Phe Arg Pro Lys Val Pro He His Val Ala 

660 665 670 
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Met Thr Gly Ala Leu Asn Ala Cys Gly Glu lie Glu Lys Val 

675 680 685 



Ser Phe 



Arg Ser Thr Lys Leu Ala Thr Ala His Arg Leu Gly Leu Arg Leu Ala 
690 695 700 

Gly Pro Gly Ala Phe Asp Val Asn Thr Gly Pro Asn Trp Ala Thr Phe 

705 710 715 720 



lie Lys Arg 



Phe Pro His Asn Pro Arg Asp Trp Asp Arg Leu Pro Tyr 
725 730 735 



Leu Asn Leu Pro Tyr Leu Pro Pro Asn Ala Gly 

740 745 



Gin Tyr His Leu 
750 



> • *■ 



Ala Met Ala Ala Ser Glu Phe Lys Glu Thr Pro Glu Leu Glu 
755 760 765 



Ser Ala 



Val Arg Ala Met Glu Ala Ala Ala Asn Val Asp Pro Leu Phe Gin Ser 

770 775 780 



Ala Leu 
785 



Val Phe Met Trp Leu Glu Glu Asn Gly He Val Thr Asp 
790 795 800 



Met Ala Asn Phe Ala Leu Ser Asp Pro Asn Ala His Arg Met Arg Asn 

805 810 815 

Phe Leu Ala Asn Ala Pro Gin Ala Gly Ser Lys Ser Gin Arg Ala Lys 

820 825 830 



Tyr Gly Thr 
835 



Gly Tyr Gly Val Glu Ala 

840 



Gly Pro Thr Pro Glu 
845 



Glu Ala Gin Arg Glu Lys Asp Thr Arg He Ser Lys Lys Met Glu Thr 
850 855 860 



. m * 



Met Gly lie Tyr Phe Ala Thr Pro Glu Trp Val Ala Leu Asn Gly His 
865 870 875 880 



Arg Gly Pro Ser 



Pro Gly Gin Leu Lys Tyr Trp Gin Asn Thr Arg Glu 
885 890 895 



lie Pro Asp Pro Asn Glu Asp Tyr Leu Asp Tyr Val His Ala Glu Lys 

900 905 910 

Ser Arg Leu Ala Ser Glu Glu Gin He Leu Arg Ala Ala Thr Ser He 
915 920 925 

Tyr Gly Ala Pro Gly Gin Ala Glu Pro Pro Gin Ala Phe He Asp Glu 

930 935 940 

Val Ala Lys Val Tyr Glu He Asn His Gly Arg Gly Pro Asn Gin Glu 
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945 950 955 960 

Gin Met Lys Asp Leu Leu Leu Thr Ala Met Glu Met Lys His Arg Asn 

965 970 975 

Pro Arg Arg Ala Leu Pro Lys Pro Lys Pro Lys Pro Asn Ala Pro Thr 

980 985 990 

Gin Arg Pro Pro Gly Arg Leu Gly Arg Trp He Arg Thr Val Ser Asp 
995 1000 1005 

Glu Asp Leu Glu 
1010 

'(2) INFORMATION FOR* SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 97.. 531 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGATACGATC GGTCTGACCC CGGGGGAGTC ACCCGGGGAC AGGCCATCAC TGCCTTGTTC 60 
CTGGTTGGAA CTCCTCTTTC TGCTGTACTA TCGTTG ATG GTG AGT AGA GAT CAG 114 

Met Val Ser Arg Asp Gin 
1015 



ACA AAC GAT CGC AGC GAT GAC AAA CCT GAT GGA TCA CAC CCA ACA GAT 162 
Thr Asn Asp Arg Ser Asp Asp Lys Pro Asp Gly Ser His Pro Thr Asp 
1020 1025 1030 

- • » 

TGT TCC GTT CAT ACG GAG CCT TCT GAT GCC AAC GAC CGG ACC GGC GTC 210 
Cys Ser Val His Thr Glu Pro Ser Asp Ala Asn Asp Arg Thr Gly Val 
1035 1040 1045 1050 

CAT TCC GGA CGA CAC CCT GGA GAA GCA CAC ACT CAG GTC CGA AAC CTC 258 
His Ser Gly Arg His Pro Gly Glu Ala His Thr Gin Val Arg Asn Leu 

1055 1060 1065 

GAC TTA CAA CTT GAC TGT AGG GGA TAC AGG GTC AGG ACT AAT TGT CTT 306 
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Asp Leu Gin Leu Asp Cys Arg Gly Tyr Arg Val Arg Thr Asn Cys Leu 

1070 1075 1080 

TTT CCC TGG ATT CCC TGG TTC AGT TGT AGG TGC TCA CTA CAC ACT GCA 354 
Phe Pro Trp lie Pro Trp Phe Ser Cys Arg Cys Ser Leu His Thr Ala 
1085 1090 1095 

GAG CAG TGG GAA CTA CCA ATT CGA CCA GAT GCT CCT GAC AGC GCA GAA 402 
Glu Gin Trp Glu Leu Pro lie Arg Pro Asp Ala Pro Asp Ser Ala Glu 
1100 1105 1110 

CCT GCC TGC CAG CTA CAA CTA CTG CAG GCT AGT GAG CAG GAG TCT AAC 450 
Pro Ala Cys Gin Leu Gin Leu Leu Gin Ala Ser Glu Gin Glu Ser Asn 
1115 1120 1125 1130 

CGT ACG GTC AAG CAC ACT CCC TGG TGG CGT TTA TGC ACT AAA CGG AAC 498 
Arg Thr Val Lys His Thr Pro Trp Trp Arg Leu Cys Thr Lys Arg Asn 

1135 1140 1145 

CAT AAA CGC AGT GAC CTT CCA CGG AAG CCT GAG TGAGTTGACT GACTACAGCT 551 

His Lys Arg Ser Asp Leu Pro Arg Lys Pro Glu 

1150 1155 

ACAACGGGCT GATGTCAGCC ACTGCGAACA TCAACGACAA GATCGGGAAC GTTCTAGTTG 611 

GAGAAGGGGT GACTGTTCTC AGTCTACCGA CTTCATATGA CCTTAGTTAT GTGAGACTCG 671 

GTGACCCCAT CCCCGCAGCA GGACTCGACC CGAAGTTGAT GGCCACGTGC GACAGTAGTG 731 

ACAGACCCAG AGTCTACACC ATAACAGCTG CAGATGAATA CCAATTCTCG TCACAACTCA 791 

TCCCGAGTGG CGTGAAGACC ACACTGTTCT CCGCCAACAT CGATGCTCTC ACCAGCTTCA 851 

.... ■ ■ 

GCGTTGGTGG TGAGCTTGTC TTCAGCCAAG TAACGATCCA AAGCATTGAA GTGGACGTCA 911 

CCATTCACTT CATTGGGTTT GACGGGACAG ACGTAGCAGT CAAGGCAGTT GCAACAGACT 971 

TTGGGCTGAC AACTGGGACA AACAACCTTG TGCCATTCAA CCTGGTGGTC CCAACAAATG 1031 

AGATCACCCA GCCCATCACT TCCATGAAAC TAGAGGTTGT GACCTACAAG ATTGGCGGCA 1091 

CCGCTGGTGA CCCAATATCA TGGACAGTGA GTGGTACACT AGCTGTGACG GTGCACGGAG 1151 

GCAACTACCC TGGGGCTCTC CGTCCTGTCA CCCTGGTGGC CTATGAACGA GTGGCTGCAG 1211 

GATCTGTTGT CACAGTTGCA GGGGTGAGCA ACTTCGAGCT AATCCCCAAC CCTGAGCTTG 1271 

CAAAGAACCT AGTTACAGAG TATGGCCGCT TTGACCCCGG AGCAATGAAC TACACCAAAC 1331 

TAATACTGAG TGAGAGAGAT CGTCTAGGCA TCAAGACAGT CTGGCCCACC AGGGAGTACA 1391 

CCGATTTCAG GGAGTACTTC ATGGAGGTTG CAGATCTCAA CTCACCCCTA AAGATTGCAG 1451 
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GAGCATTTGG 


CTTTAAGGAC ATAATCCGAG CCATTPGGAA fil\T r vrzmn r rn ppn^r-nm^m 


1511 


CCACACTCTT 


CCCTCCAGCT GCACCCCTAG CACATGCAAT PGG AOA AfSf^T fTRPApm^rr. 


1571 


TCCTGGGCGA 


CGAGGCCCAA GCAGCCTCAG GGACAGPTPC; AnrrnrrsTna r»r»7\ * *»r.nmii 


mm mmm, mmm 

1631 


GAGCTGCCTC 


AGGACGAATA AGGPAGPTAA PTPTPflPACP Tr , 7\r , 7i Ar^rr^r* mnnr.* ^*** m . ,-, 

* p 


1691 

• 


TCGCCAACAT 

^ lm mw M • m> mm 


GTTCCAGGTG CCCCAGAATP PP ATT^TTn a Tnr*paTTPTn rTt ^____ _ 


1751 


GAATCCTGCG 


TGGCG CACAC AACCTCGACT GCGTGCTATG GGACZCZCZzarr aptpwpop 


*m Ajmk mm mm 

1811 


CTGTTGTCAT 


TACGACACTC GAGGATGAGC TGAPPPPPAA finrarTnaar ^r>pi\^^KmAm 

*.«.ww^»w*-»w a w unwn x wnvv X w<^Vw www L/Ul wVJwiL. X OxinL AVilwAAAATGT 


mm am n mm 

1871 


TTGCTGTCAT 


TGAAGGTGTG PG AH A(V2 A rr r rCT , l\rzcr rr rr > H awv^r'iv jvr»r»n /~»^ * mnnwmA* 
iunnwivjivj v^onvjn\JOrtL^ luUiuLLltl. AlLLLAALbG GGATCCTXCA 

• * 


1931 


TTCGAACTCT 


PTPT^OPPAT ZVr^AfZTPTZVPn nr tr ri\ r rnr t r'r i r* Tir* Tir^r^r* * rtfnno«m^m«« 
w x LiuuLuiA iLi/iiu owlAlowwww AbALbGAGTA CTGCCTCTGG 


1991 


AGACCGGGAG 


AGAPTAPAPP fiTTVTT'PPPZV A T'TriiyTniiTnT r , TW | r*A/"v^**T' Ttnn«mn 

x nurt^u uiiui LLLhH x 1 ori 1 bA lul \j 1 wHjoAwCjAT AGCATAAx GC 


2051 


TGTCGCAGGA 


PPPPATAPPT PPAATP2i*T*ZVf2 nnnararr^p r , aar»r»Tivr«r«r» TiT^nr^mv /-«■» 

v~.wwW"lx.*\ww ± LLAAlUilAVJ Ute/i/v w/1 w» Loo wiALLlAuLL AT AG CAT ACA 


2111 


TGGATGTCTT 


uauululaao Vjlwwww/ixww ALblbuHAl bAUiubvjGCC CTCAATGCCC 

. ' - " ■ • 


2171 


GCGGTGAGAT 


Pf^ArSAnnY^'TT APfSTTPP/'SPA f2P'Rr , P , AA7\ /""""P p*p»/"*p» Anivnrin nunn^^/immM 
wvAOAVJAVjl X ALwUvLwUi olAlLftAALl LbLLAUAbCC CACCGACTTG 


2231 


GCATGAAGTT 


• • * 

AGPTGfiTPPT RRAf5rPT2VTn l\Pl\ f PT , l\A r P2l^ , anpapr'TR Tir(r»n/ i i>ikn/<im 
nv3V - xvjvax ww i uwiov-vlAlv ALA! 1AA1HL AuoAL w 1 AAL TGGGCAACGT 


mmm. mmm, Mmm. mm 

2291 


T CGTCAAACG 


TTfPPPTPAP AATPPPP^A^2 A f^Vfinr* RPAr /■"p^rT»r , p»riir» r«nn » -r . m^.., . 
xxx www X UiL AhlLLULuAw AL 1 buuALAb \j 1 1 u»C wvJTAC CTCAACCTTC 

< • 


2351 


CTTATCTCCC 


APPAAPAttPA f^T^APn'PPArST TPPIiTPT&PP pp*rr!Pr"r/ , »r'r» T»nnnn/iwfii/in 
uwunuwi wnLu X LAu X 1 w LA X w X HbL L.L. X buL IbLL TCCGAG TTCA 


2411 


AAGAGACCCC 


/iumu x wu/tn uawul x vj x vjl b^uUUi x bbA 1 Vj w wOw x bbA AATGCCGACC 

• 


mf^k mm m~~m mm 

2471 


CATTGTTCCG 


CTCAGCTCTC CAGGTCTTCA TGTGGTTGGA AGAAAACGGG ATTGTGACCG 


2531 


ACATGGCTAA 


CTTCGCCCTC AGCGACCCAA ACGCGCATAG GATGAAAAAC TTCCTAGCAA 


2591 


ACGCACCCCA GGCTGGAAGC AAGTCGCAGA GGGCCAAGTA TGGCACGGCA GGCTACGGAG 


mmm. _mm mmm m% 

2651 


TGGAGGCTCG 


AGGCCCCACA CCAGAAGAGG CACAGAGGGA AAAAGACACA CGGATCTCCA 


0™\ mm mm m 

2711 


AGAAGATGGA 


AACAATGGGC ATCTACTTCG CGACACCGGA ATGGGTGGCT CTCAACGGGC 


2771 


ACCGAGGCCC 


AAGCCCCGGC CAACTCAAGT ACTGGCAAAA CACAAGAGAA ATACCAGAGC 


2831 


CCAATGAGGA 


CTACCCAGAC TATGTGCACG CGGAGAAGAG CCGGTTGGCG TCAGAAGAAC 


2891 


AGATCCTACG 


GGCAGCCACG TCGATCTACG GGGCTCCAGG ACAGGCTGAA CCACCCCAGG 


2951 


CCTTCATAGA 


CGAGGTCGCC AGGGTCTATG AAATCAACCA TGGGCGTGGT CCAAACCAGG 


3011 
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AGCAGATGAA GGACCTGCTC CTGACTGCGA TGGAGATGAA GCATCGCAAT CCCAGGCGGG 3071 

CTCCACCAAA GCCAAAGCCA AAACCCAATG CTCCATCACA GAGACCCCCT GGACGGCTGG 3131 

GCCGCTGGAT CAGGACGGTC TCCGACGAGG ACTTGGAGTG AGGCTCCTGG GAGTCTCCCG 3191 

ACACTACCCG CGCAGGTGTG GACACCAATT CGGCCTTCTA CCATCCCAAA TTGGATCCGT 3251 

TCGCGGGTCC CCT 3264 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Met Val Ser Arg Asp Gin Thr Asn Asp Arg Ser Asp Asp Lys Pro Asp 
1 5 10 15 

Gly Ser His Pro Thr Asp Cys Ser Val His Thr Glu Pro Ser Asp Ala 

20 25 30 

Asn Asp Arg Thr Gly Val His Ser Gly Arg His Pro Gly Glu Ala His 

35 40 45 

Thr Gin Val Arg Asn Leu Asp Leu Gin Leu Asp Cys Arg Gly Tyr Arg 
50 55 60 

Val Arg Thr Asn Cys Leu Phe Pro Trp lie Pro Trp Phe Ser Cys Arg 
65 70 75 80 

Cys Ser Leu His Thr Ala Glu Gin Trp Glu Leu Pro lie Arg Pro Asp 

85 90 95. 

Ala Pro Asp Ser Ala Glu Pro Ala Cys Gin Leu Gin Leu Leu Gin Ala 

100 105 110 

Ser Glu Gin Glu Ser Asn Arg Thr Val Lys His Thr Pro Trp Trp Arg 
115 120 125 

Leu Cys Thr Lys Arg Asn His Lys Arg Ser Asp Leu Pro Arg Lys Pro 
130 135 140 

Glu 
145 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 131.. 3169 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GGATACGATC GGTCTGACCC CGGGGGAGTC ACCCGGGGAC AGGCCATCAC TGCCTTGTTC 60 
CTGGTTGGAA CTCCTCTTTC TGCTGTACTA TCGTTGATGG TGAGTAGAGA TCAGACAAAC 120 

GATCGCAGCG ATG ACA AAC CTG ATG GAT CAC ACC CAA CAG ATT GTT CCG 169 

Met Thr Asn Leu Met Asp His Thr Gin Gin lie Val Pro 

150 155 

TTC ATA CGG AGC CTT CTG ATG CCA ACG ACC GGA CCG GCG TCC ATT CCG 217 
Phe lie Arg Ser Leu Leu Met Pro Thr Thr Gly Pro Ala Ser lie Pro 
160 165 170 

GAC GAC ACC CTG GAG AAG CAC ACA CTC AGG TCC GAA ACC TCG ACT TAC 265 
Asp Asp Thr Leu Glu Lys His Thr Leu Arg Ser Glu Thr Ser Thr Tyr 
175 1B0 185 190 

AAC TTG ACT GTA GGG GAT ACA GGG TCA GGA CTA ATT GTC TTT TTC CCT 313 
Asn Leu Thr Val Gly Asp Thr Gly Ser Gly Leu lie Val Phe Phe Pro 

195 200 205 

GGA TTC CCT GGT TCA GTT GTA GGT GCT CAC TAC ACA CTG CAG AGC AGT 361 
Gly Phe Pro Gly Ser Val Val Gly Ala His Tyr Thr Leu Gin Ser Ser 

210 215 220 

GGG AAC TAC CAA TTC GAC CAG ATG CTC CTG ACA GCG CAG AAC CTG CCT 409 
Gly Asn Tyr Gin Phe Asp Gin Met Leu Leu Thr Ala Gin Asn Leu Pro 
225 230 235 



GCC AGC TAC AAC TAC TGC AGG CTA GTG AGC AGG AGT CTA ACC GTA CGG 

Ala Ser Tyr Asn Tyr Cys Arg Leu Val Ser Arg Ser Leu Thr Val Arg 
240 245 250 



457 



55 

TCA AGC ACA CTC CCT GGT GGC GTT TAT GCA CTA AAC GGA ACC ATA AAC 
Ser Ser Thr Leu Pro Gly Gly Val Tyr Ala Leu Asn Gly Thr lie Asn 

260 265 270 



GCA GTG ACC TTC CAC GGA AGC CTG AGT GAG TTG ACT GAC TAC AGC TAC 
Ala Val Thr Phe His Gly Ser Leu Ser Glu Leu Thr Asp Tyr Ser Tyr 

275 280 285 



505 



AAC GGG CTG ATG TCA GCC ACT GCG AAC ATC AAC GAC AAG ATC GGG AAC 
Asn Gly Leu Met Ser Ala Thr Ala Asn lie Asn Asp Lys He Gly Asn 

290 295 300 



601 



GTT CTA GTT GGA GAA GGG GTG ACT GTT CTC AGT CTA CCG ACT TCA TAT 
Val Leu Val Gly Glu Gly Val Thr Val Leu Ser Leu Pro Thr Ser Tyr 
305 310 315 

GAC CTT AGT TAT GTG AGA CTC GGT GAC CCC ATC CCC GCA GCA GGA CTC 
Asp Leu Ser Tyr Val Arg Leu Gly Asp Pro He Pro Ala Ala Gly Leu 

320 325 330 

GAC CCG AAG TTG ATG GCC ACG TGC GAC AGT AGT GAC AGA CCC AGA GTC 

Asp Pro Lys Leu Met Ala Thr Cys Asp Ser Ser Asp Arg Pro Arg Val 

340 345 350 



TAC ACC ATA ACA GCT GCA GAT GAA TAC CAA TTC TCG TCA CAA CTC ATC 
Tyr Thr He Thr Ala Ala Asp Glu Tyr Gin Phe Ser Ser Gin Leu He 

355 360 365 

CCG AGT GGC GTG AAG ACC ACA CTG TTC TCC GCC AAC ATC GAT GCT CTC 
Pro Ser Gly Val Lys Thr Thr Leu Phe Ser Ala Asn He Asp Ala Leu 

370 375 380 

ACC AGC TTC AGC GTT GGT GGT GAG CTT GTC TTC AGC CAA GTA ACG ATC 
Thr Ser Phe Ser Val Gly Gly Glu Leu Val Phe Ser Gin Val Thr He 

390 



CAA AGC ATT GAA GTG GAC GTC ACC ATT CAC TTC ATT GGG TTT GAC GGG 
Gin Ser He Glu Val Asp Val Thr He His Phe He Gly Phe Asp Gly 
400 405 410 

ACA GAC GTA GCA GTC AAG GCA GTT GCA ACA GAC TTT GGG CTG ACA ACT 
Thr Asp Val Ala Val Lys Ala Val Ala Thr Asp Phe Gly Leu Thr Thr 
415 420 425 430 

GGG ACA AAC AAC CTT GTG CCA TTC AAC CTG GTG GTC CCA ACA AAT GAG 
Gly Thr Asn Asn Leu Val Pro Phe Asn Leu Val Val Pro Thr Asn Glu 

435 440 445 

ATC ACC CAG CCC ATC ACT TCC ATG AAA CTA GAG GTT GTG ACC TAC AAG 
He Thr Gin Pro He Thr Ser Met Lys Leu Glu Val Val Thr Tyr Lys 

450 455 460 



649 



697 



745 



793 



841 



889 



937 



985 



1033 



1081 
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ATT GGC GGC ACC GCT GGT GAC CCA ATA TCA TGG ACA GTG AGT GGT ACA 1129 
He Gly Gly Thr Ala Gly Asp Pro He Ser Trp Thr Val Ser Gly Thr 
465 470 475 

CTA GCT GTG ACG GTG CAC GGA GGC AAC TAC CCT GGG GCT CTC CGT CCT 1177 

Leu Ala Val Thr Val His Gly Gly Asn Tyr Pro Gly Ala Leu Arg Pro 

480 485 490 

GTC ACC CTG GTG GCC TAT GAA CGA GTG GCT GCA GGA TCT GTT GTC ACA 1225 
Val Thr Leu Val Ala Tyr Glu Arg Val Ala Ala Gly Ser Val Val Thr 
495 500 505 510 

GTT GCA GGG GTG AGC AAC TTC GAG CTA ATC CCC AAC CCT GAG CTT GCA 1273 
Val Ala Gly Val Ser Asn Phe Glu Leu He Pro Asn Pro Glu Leu Ala 

520 525 



AAG AAC CTA GTT ACA GAG TAT GGC CGC TTT GAC CCC GGA GCA ATG AAC ' 1321 
Lys Asn Leu Val Thr Glu Tyr Gly Arg Phe Asp Pro Gly Ala Met Asn 

530 535 540 

TAC ACC AAA CTA ATA CTG AGT GAG AGA GAT CGT CTA GGC ATC AAG ACA 1369 
Tyr Thr Lys Leu He Leu Ser Glu Arg Asp Arg Leu Gly He Lys Thr 
545 550 



GTC TGG CCC ACC AGG GAG TAC ACC GAT TTC AGG GAG TAC TTC ATG GAG 1417 
Val Trp Pro Thr Arg Glu Tyr Thr Asp Phe Arg Glu Tyr Phe Met Glu 
560 565 570 

GTT GCA GAT CTC AAC TCA CCC CTA AAG ATT GCA GGA GCA TTT GGC TTT 1465 
Val Ala Asp Leu Asn Ser Pro Leu Lys He Ala Gly Ala Phe Gly Phe 
575 580 585 590 

AAG GAC ATA ATC CGA GCC ATT CGG AAG ATT GCG GTG CCA GTG GTA TCC 1513 
Lys Asp lie He Arg Ala He Arg Lys He Ala Val Pro Val Val Ser 

595 600 605 

ACA CTC TTC CCT CCA GCT GCA CCC CTA GCA CAT GCA ATC GGA GAA GGT 1561 

Thr Leu Phe Pro Pro Ala Ala Pro Leu Ala His Ala lie Gly Glu Gly 

610 615 620 

GTA GAC TAC CTC CTG GGC GAC GAG GCC CAA GCA GCC TCA GGG ACA GCT 1609 
Val Asp Tyr Leu Leu Gly Asp Glu Ala Gin Ala Ala Ser Gly Thr Ala 

630 635 



CGA GCC GCG TCA GGA AAA GCT AGA GCT GCC TCA GGA CGA ATA AGG CAG 1657 
Arg Ala Ala Ser Gly Lys Ala Arg Ala Ala Ser Gly Arg He Arg Gin 
640 645 650 



CTA ACT CTC GCA GCT GAC AAG GGG TGC GAG GTA GTC GCC AAC ATG TTC 1705 
Leu Thr Leu Ala Ala Asp Lys Gly Cys Glu Val Val Ala Asn Met Phe 
655 660 665 670 
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CAG GTG CCC CAG AAT CCC ATT GTT GAT GGC ATT CTG GCA TCC CCA GGA 1753 
Gin Val Pro Gin Asn Pro He Val Asp Gly He Leu Ala Ser Pro Gly 

675 680 685 

ATC CTG CGT GGC GCA CAC AAC CTC GAC TGC GTG CTA TGG GAG GGA GCC 1801 
He Leu Arg Gly Ala His Asn Leu Asp Cys Val Leu Tip Glu Gly Ala 

690 695 700 

ACT CTT TTC CCT GTT GTC ATT ACG ACA CTC GAG GAT GAG CTG ACC CCC 1849 
Thr Leu Phe Pro Val Val He Thr Thr Leu Glu Asp Glu Leu Thr Pro 
705 710 715 

AAG GCA CTG AAC AGC AAA ATG TTT GCT GTC ATT GAA GGT GTG CGA GAG 1897 
Lys Ala Leu Asn Ser Lys Met Phe Ala Val He Glu Gly Val Arg Glu 
720 725 730 

GAC CTC CAG CCT CCA TCC CAA CGG GGA TCC TTC ATT CGA ACT CTC TCT 1945 
Asp Leu Gin Pro Pro Ser Gin Arg Gly Ser Phe He Arg Thr Leu Ser 
735 740 745 750 

GGC CAT AGA GTC TAT GGC TAT GCC CCA GAC GGA GTA CTG CCT CTG GAG 1993 
Gly His Arg Val Tyr Gly Tyr Ala Pro Asp Gly Val Leu Pro Leu Glu 

755 760 765 

ACC GGG AGA GAC TAC ACC GTT GTC CCA ATT GAT GAT GTG TGG GAC GAT 2041 
Thr Gly Arg Asp Tyr Thr Val Val Pro He Asp Asp Val Tip Asp Asp 

770 775 780 

AGC ATA ATG CTG TCG CAG GAC CCC ATA CCT CCA ATC ATA GGG AAC AGC 2089 
Ser He Met Leu Ser Gin Asp Pro He Pro Pro He He Gly Asn Ser 
785 790 795 

GGC AAC CTA GCC ATA GCA TAC ATG GAT GTC TTC AGG CCC AAG GTC CCC 2137 

Gly Asn Leu Ala He Ala Tyr Met Asp Val Phe Arg Pro Lys Val Pro 

800 805 810 

ATC CAC GTG GCT ATG ACA GGG GCC CTC AAT GCC CGC GGT GAG ATC GAG 2185 

He His Val Ala Met Thr Gly Ala Leu Asn Ala Arg Gly Glu He Glu 

815 820 825 830 

AGT GTT ACG TTC CGC AGC ACC AAA CTC GCC ACA GCC CAC CGA CTT GGC 2233 
Ser Val Thr Phe Arg Ser Thr Lys Leu Ala Thr Ala His Arg Leu Gly 

835 840 845 

ATG AAG TTA GCT GGT CCT GGA GCC TAT GAC ATT AAT ACA GGA CCT AAC 2281 
Met Lys Leu Ala Gly Pro Gly Ala Tyr Asp He Asn Thr Gly Pro Asn 

850 855 860 

TGG GCA ACG TTC GTC AAA CGT TTC CCT CAC AAT CCC CGA GAC TGG GAC 2329 
Trp Ala Thr Phe Val Lys Arg Phe Pro His Asn Pro Arg Asp Trp Asp 
865 870 875 
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AGG TTG CCC TAC CTC AAC CTT CCT TAT CTC CCA CCA ACA GCA GGA CGT 2377 

Arg Leu Pro Tyr Leu Asn Leu Pro Tyr Leu Pro Pro Thr Ala Gly Arg 

880 885 890 

CAG TTC CAT CTA GCC CTG GCT GCC TCC GAG TTC AAA GAG ACC CCA GAA 2425 
Gin Phe His Leu Ala Leu Ala Ala Ser Glu Phe Lys Glu Thr Pro Glu 
895 900 905 910 

* 

CTC GAA GAC GCT GTG CGC GCA ATG GAT GCC GCT GCA AAT GCC GAC CCA 2473 
Leu Glu Asp Ala Val Arg Ala Met Asp Ala Ala Ala Asn Ala Asp Pro 

915 920 925 

TTG TTC CGC TCA GCT CTC CAG GTC TTC ATG TGG TTG GAA GAA AAC GGG 2521 
Leu Phe Arg Ser Ala Leu Gin Val Phe Met Trp Leu Glu Glu Asn Gly 

330 935 940 

ATT GTG ACC GAC ATG GCT AAC TTC GCC CTC AGC GAC CCA AAC GCG CAT 2569 
lie Val Thr Asp Met Ala Asn Phe Ala Leu Ser Asp Pro Asn Ala His 
945 950 



AGG ATG AAA AAC TTC CTA GCA AAC GCA CCC CAG GCT GGA AGC AAG TCG 2617 
Arg Met Lys Asn Phe Leu Ala Asn Ala Pro Gin Ala Gly Ser Lys Ser 
960 965 970 

CAG AGG GCC AAG TAT GGC ACG GCA GGC TAC GGA GTG GAG GCT CGA GGC 2665 
Gin Arg Ala Lys Tyr Gly Thr Ala Gly Tyr Gly Val Glu Ala Arg Gly 
975 980 985 990 

CCC ACA CCA GAA GAG GCA CAG AGG GAA AAA GAC ACA CGG ATC TCC AAG 2713 
Pro Thr Pro Glu Glu Ala Gin Arg Glu Lys Asp Thr Arg lie Ser Lys 

995 1000 1005 

AAG ATG GAA ACA ATG GGC ATC TAC TTC GCG ACA CCG GAA TGG GTG GCT 2761 
Lys Met Glu Thr Met Gly He Tyr Phe Ala Thr Pro Glu Trp Val Ala 

1010 1015 1020 

CTC AAC GGG CAC CGA GGC CCA AGC CCC GGC CAA CTC AAG TAC TGG CAA 2809 

Leu Asn Gly His Arg Gly Pro Ser Pro Gly Gin Leu Lys Tyr Trp Gin 
1025 1030 1035 

AAC ACA AGA GAA ATA CCA GAG CCC AAT GAG GAC TAC CCA GAC TAT GTG 2857 
Asn Thr Arg Glu He Pro Glu Pro Asn Glu Asp Tyr Pro Asp Tyr Val 
1040 1045 1050 



CAC 


GCG 


GAG 


AAG 


AGC CGG TTG 


GCG 


TCA 


GAA 


GAA 


CAG 


ATC 


CTA 


CGG 


GCA 


2905 


His 


Ala 


Glu 


Lys 


Ser Arg Leu 


Ala 


Ser 


Glu 


Glu 


Gin 


He 


Leu 


Arg 


Ala 




1055 






1060 








1065 








1070 




GCC 


ACG 


TCG 


ATC 


TAC GGG GCT 


CCA 


GGA 


CAG 


GCT 


GAA 


CCA 


CCC 


CAG 


GCC 


2953 


Ala 


Thr 


Ser 


He 


Tyr Gly Ala 


Pro 


Gly 


Gin 


Ala 


Glu 


Pro 


Pro 


Gin 


Ala 












1075 






1080 








1085 
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TTC ATA GAC GAG GTC GCC AGG GTC TAT GAA ATC AAC CAT GGG GGT GGT 3001 
Phe He Asp Glu Val Ala Arg Val Tyr Glu He Asn His Gly Arg Gly 

1090 1095 1100 

CCA AAC CAG GAG CAG ATG AAG GAC CTG CTC CTG ACT GCG ATG GAG ATG 3049 

Pro Asn Gin Glu Gin Met Lys Asp Leu Leu Leu Thr Ala Met Glu Met 
1105 IHO 1115 

AAG CAT CGC AAT CCC AGG CGG GCT CCA CCA AAG CCA AAG CCA AAA CCC 3097 
Lys His Arg Asn Pro Arg Arg Ala Pro Pro Lys Pro Lys Pro Lys Pro 
1120 H25 H30 

AAT GCT CCA TCA CAG AGA CCC CCT GGA CGG CTG GGC CGC TGG ATC AGG 3145 

Asn Ala Pro Ser Gin Arg Pro Pro Gly Arg Leu Gly Arg Trp He Arg 
1135 1140 H45 1150 

ACG GTC TCC GAC GAG GAC TTG GAG TGAGGCTCCT GGGAGTCTCC CGACACTACC 3199 
Thr Val Ser Asp Glu Asp Leu Glu 



CGCGCAGGTG TGGACACCAA TTCGGCCTTC TACCATCCCA AATTGGATCC GTTCGCGGGT 3259 



CCCCT 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Thr Asn Leu Met Asp His Thr Gin Gin He Val Pro Phe He Arg 
15 10 15 

Ser.Leu Leu Met Pro Thr Thr Gly Pro Ala Ser He Pro Asp Asp Thr 

20 25 30 

Leu Glu Lys His Thr Leu Arg Ser Glu Thr Ser Thr Tyr Asn Leu Thr 

35 40 45 

Val Gly Asp Thr Gly Ser Gly Leu He Val Phe Phe Pro Gly Phe Pro 
50 55 60 

Gly Ser Val Val Gly Ala His Tyr Thr Leu Gin Ser Ser Gly Asn Tyr 
65 70 75 80 



3264 



Gin Phe Asp Gin Met Leu Leu Thr Ala Gin Asn Leu Pro Ala Ser Tyr 
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85 90 95 

Asn Tyr Cys Arg Leu Val Ser Arg Ser Leu Thr Val Arg Ser Ser Thr 

100 105 110 

Leu Pro Gly Gly Val Tyr Ala Leu Asn Gly Thr lie Asn Ala Val Thr 
115 120 125 

Phe His Gly Ser Leu Ser Glu Leu Thr Asp Tyr Ser Tyr Asn Gly Leu 
130 135 140 

Met Ser Ala Thr Ala Asn lie Asn Asp Lys lie Gly Asn Val Leu Val 
145 150 155 160 

Gly Glu Gly Val Thr Val Leu Ser Leu Pro Thr Ser Tyr Asp Leu Ser 

165 170 175 

Tyr Val Arg Leu Gly Asp Pro He Pro Ala Ala Gly Leu Asp Pro Lys 

180 185 190 

Leu Met Ala Thr Cys Asp Ser Ser Asp Arg Pro Arg Val Tyr Thr He 

200 205 



Thr Ala Ala Asp Glu Tyr Gin Phe Ser Ser Gin Leu He Pro Ser Gly 
210 215 220 

Val Lys Thr Thr Leu Phe Ser Ala Asn He Asp Ala Leu Thr Ser Phe 

230 235 240 



Ser Val Gly Gly Glu Leu Val Phe Ser Gin Val Thr He Gin Ser lie 

245 250 



Glu Val Asp Val Thr He His Phe He Gly Phe Asp Gly Thr Asp Val 

260 265 270 

« * • 

Ala Val Lys Ala Val Ala Thr Asp Phe Gly Leu Thr Thr Gly Thr Asn 
275 280 285 

Asn Leu Val Pro Phe Asn Leu Val Val Pro Thr Asn Glu He Thr Gin 
290 295 300 

Pro He Thr Ser Met Lys Leu Glu Val Val Thr Tyr Lys He Gly Gly 
305 310 315 320 

Thr Ala Gly Asp Pro He Ser Trp Thr Val Ser Gly Thr Leu Ala Val 

325 330 335 

Thr Val His Gly Gly Asn Tyr Pro Gly Ala Leu Arg Pro Val Thr Leu 

340 345 350 



Val Ala Tyr Glu Arg Val Ala Ala Gly Ser Val Val Thr Val Ala Gly 
355 360 365 
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Val Ser Asn Phe Glu 
370 

Val Thr Glu Tyr Gly 
385 

Leu lie Leu Ser Glu 

405 

Thr Arg Glu Tyr Thr 

420 

Leu Asn Ser Pro Leu 
435 

lie Arg Ala lie Arg 
450 

Pro Pro Ala Ala Pro 
465 



Leu Leu Gly Asp Glu 

485 

Ser Gly Lys Ala Arg 

500 

Ala Ala Asp Lys Gly 
515 

Gin Asn Pro lie Val 
530 • 

■ 

Gly Ala His Asn Leu 
545 

Pro Val Val He Thr 

565 

Asn Ser Lys Met Phe 

580 

Pro Pro Ser Gin Arg 
595 

Val Tyr Gly Tyr Ala 
610 

Asp Tyr Thr Val Val 
625 
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Leu He Pro Asn Pro Glu 
375 

Arg Phe Asp Pro Gly Ala 
390 395 

Arg Asp Arg Leu Gly He 

410 

Asp Phe Arg Glu Tyr Phe 

425 

Lys He Ala Gly Ala Phe 
440 

Lys He Ala Val Pro Val 
455 

Leu Ala His Ala He Gly 
470 475 



Ala Gin Ala Ala Ser Gly 

490 

Ala Ala Ser Gly Arg He 

505 

Cys Glu Val Val Ala Asn 
520 

Asp Gly He Leu Ala Ser 
535 

Asp Cys Val Leu Trp Glu 
550 555 

Thr Leu Glu Asp Glu Leu 

570 

Ala Val He Glu Gly Val 

585 

Gly Ser Phe He Arg Thr 
600 

Pro Asp Gly Val Leu Pro 
615 

Pro He Asp Asp Val Trp 
630 635 



Leu Ala Lys Asn Leu 
380 

Met Asn Tyr Thr Lys 

400 

Lys Thr Val Trp Pro 

415 

Met Glu Val Ala Asp 

430 

Gly Phe Lys Asp He 
445 

Val Ser Thr Leu Phe 
460 

Glu Gly Val Asp Tyr 

480 



Thr Ala Arg Ala Ala 

495 

Arg Gin Leu Thr Leu 

510 

Met Phe Gin Val Pro 
525 

Pro Gly He Leu Arg 
540 

Gly Ala Thr Leu Phe 

560 

Thr Pro Lys Ala Leu 

575 

Arg Glu Asp Leu Gin 
590 

Leu Ser Gly His Arg 
605 

Leu Glu Thr Gly Arg 
620 

Asp Asp Ser He Met 

640 
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Leu Ser Gin Asp 



Ala lie Ala Tyr 

660 



Pro lie Pro Pro 

645 

Met Asp Val Phe 
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lie lie Gly Asn 
650 

Arg Pro Lys Val 
665 



Ser Gly Asn Leu 
655 

Pro lie His Val 
670 



Ala Met Thr Gly Ala Leu Asn Ala Arg Gly Glu lie Glu Ser Val Thr 
675 680 685 

Phe Arg Ser Thr Lys Leu Ala Thr Ala His Arg Leu Gly Met Lys Leu 
690 695 700 

Ala Gly Pro Gly Ala Tyr Asp lie Asn Thr Gly Pro Asn Trp Ala Thr 

705 710 715 720 

Phe Val Lys Arg Phe Pro His Asn Pro Arg Asp Trp Asp Arg Leu Pro 

725 730 735 

Tyr Leu Asn Leu Pro Tyr Leu Pro Pro Thr Ala Gly Arg Gin Phe His 

740 745 750 

Leu Ala Leu Ala Ala Ser Glu Phe Lys Glu Thr Pro Glu Leu Glu Asp 
755 760 765 



Ala Val Arg Ala Met Asp Ala Ala Ala Asn Ala Asp Pro Leu Phe Arg 

770 775 780 

Ser Ala Leu Gin Val Phe Met Trp Leu Glu Glu Asn Gly lie Val Thr 
785 790 795 800 



Asp Met Ala Asn 



Phe Ala Leu 
805 



Asp Pro Asn Ala His Arg Met Lys 
810 815 



Asn Phe Leu Ala Asn Ala Pro Gin Ala Gly Ser Lys Ser Gin Arg Ala 

820 825 830 

Lys Tyr Gly Thr Ala Gly Tyr Gly Val Glu Ala Arg Gly Pro Thr Pro 

835 840 845 



Glu Glu Ala Gin Arg Glu Lys Asp Thr Arg 
850 855 



Ser Lys Lys Met Glu 
860 



Thr Met Gly lie Tyr Phe Ala Thr Pro Glu Trp Val Ala Leu Asn Gly 
865 870 875 880 



His Arg Gly Pro Ser Pro Gly Gin Leu Lys Tyr Trp Gin Asn Thr Arg 

885 890 895 



Glu lie Pro Glu Pro Asn Glu Asp Tyr Pro Asp Tyr Val His Ala Glu 

900 905 910 
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Lys Ser Arg Leu Ala Ser Glu Glu Gin lie Leu Arg Ala Ala Thr Ser 

915 920 925 

lie Tyr Gly Ala Pro Gly Gin Ala Glu Pro Pro Gin Ala Phe lie Asp 
930 935 940 

Glu Val Ala Arg Val Tyr Glu lie Asn His Gly Arg Gly Pro Asn Gin 
945 950 955 960 

Glu Gin Met Lys Asp Leu Leu Leu Thr Ala Met Glu Met Lys His Arg 

965 970 975 

Asn Pro Arg Arg Ala Pro Pro Lys Pro Lys Pro Lys Pro Asn Ala Pro 

980 985 990 

Ser Gin Arg Pro Pro Gly Arg Leu Gly Arg Trp He Arg Thr Val Ser 
995 1000 1005 

Asp Glu Asp Leu Glu 
1010 
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Claims 

1 . A method for preparing live Bimavirus, comprising the following 

steps: 

preparing a cDNA containing infectious bursal disease virus genome 
segments A and B, 

transcribing said cDNA to produce synthetic RNA transcripts, 

■ 

transfecting host cells with said synthetic RNA transcripts, 

incubating said host cells in a culture medium, and 

isolating live infectious bursal disease virus from said culture medium. 

2. The method according to claim 1 , wherein said Bimavirus is 
infectious bursal disease virus. 

3. The method according to claim 1, wherein said host cells are African 
green monkey Vera cells. 

4. The method according to claim 1, wherein said segments A and B of 
said cDNA are independently prepared. 

5. The method according to claim 4, wherein said segment A is present 
in plasmid pUC19FLAD78 or pUC18FLA23. 

6. The method according to claim 4, wherein said segment B is present 
in plasmid pUC18FLBP2. 

7. A live infectious bursal disease virus, wherein said virus is made by 
a process comprising the steps of preparing a cDNA containing infectious 
bursal disease virus genome segments A and B, 

transcribing said cDNA to produce a synthetic RNA transcript, 
transfecting a host cell with said synthetic RNA transcript, 
incubating said host cell in a culture medium, and 
isolating live infectious bursal disease virus from said culture medium. 

8. A synthetic RNA encoding proteins VP1, VP2, VP3, VP4, and VP5 
of infectious bursal disease virus. 

9. A host cell transfected with the synthetic RNA according to claim 8. 

10. A cDNA containing at least a portion of the infectious bursal 
disease virus genome selected from the group consisting of segment A, 
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segment B and segments A and B of infectious bursal disease virus, wherein 
said cDNA includes the 5' and 3' terminii of said segments. 

1 1 . A recombinant vector comprising the cDNA according to claim 10. 

12. The vector according to claim 1 1 , wherein said vector is a plasmid. 

13. The vector according to claim 12, wherein said plasmid is selected 
from the group consisting of pUC19FLAD78, pUC18FLA23 and 
pUC19FLBP2. 

14. A host cell transformed with the vector according to claim 1 1 . 

15. A vaccine comprising an infectious bursal disease virus according 
to claim 7, wherein said infectious bursal disease virus is inactivated or 
attenuated prior to administration. 

16. A method for producing a live infectious bursal disease virus 

vaccine, comprising the steps of 

preparing a full-length cDNA containing infectious bursal disease vims 

genome segments A and B, 

transcribing said cDNA to produce synthetic RNA transcripts, 

purifying said synthetic RNA transcripts, 

transfecting host cells with said purified RNA transcripts, 

incubating said host cells in a culture medium, 

isolating live infectious bursal disease virus from said culture medium, 

attenuating said live infectious bursal disease virus to produce a virus 
with reduced virulence, and 

combining said live infectious bursal disease virus with a 
pharmaceutical^ acceptable carrier to produce a live infectious bursal 

disease virus vaccine. 

17. The method according to claim 16, wherein said live infectious 
bursal disease vims is attenuated by serial passage or site directed 
mutagenesis. 

18. The method according to claim 1, wherein said host cells are 
poultry cells. 

19. The method according to claim 18, wherein said poultry cells are 
chicken, turkey, or quail cells. 
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20. The method according to claim 19, wherein said poultry cells are 
chicken embryo fibroblast cells or chicken embryo kidney cells. 
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